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If we take in our hand any volume, let us ask, 

Does it contain апу abstract reasoning concern- 

ing quantity or number? No. Does it contain 

апу experimental reasoning concerning matter « == 
of fact and existence? No. Commit it then to 

the flames; for it can contain nothing but 

sophistry and illusion.— uw. 
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FOREWORD 


vanced by new techniques and new instruments, as well as Бу? 


Г psychology, as in other sciences, knowledge has been ad- 


Anew modes of thought, for ‘the bare hand and the understand- 
Hig "еб око itself have but little power; results are produced Бу 
instruments and helps’. Whether the methods so cogently set out 
in this book will have great potency in the study of human person- 
ality is yet uncertain: the subject is elusive and the past failures, as 
the author points out, many. But Or. Eysenck's enterprise has now 
come a long way, in a rather short time, and even those who dis- 
trust statistical procedures and statisticians’ reasoning, or,who find 
some of Dr. Eysenck’s conclusions at odds with their experience, 


. will concede his two major claims: he has used methods of observa- 
tion and analysis which are well defined and easily reproduced by . 


Other investigators, who can thus test the correctness of his find- 
Ings; and there is immense gain to be had from a taxonomy which 
Permits the cardinal attributes of each type to be measured, 
Whether the types are of mental disease, of personality, or of 
general constitution. The more accurate the measurement and 
the more distinct the type, the greater the security which such a 
classification affords for further scientific inquiry. There are per- 
haps other advantages. If the types discerned show any constant 
relation with the course and pathology of disease, then they will 
* speedily enthroned, because of their value in clinical practice. 
Ut it is not on this ground that Dr. Eysenc s 
emphasizes rather the scientific and heuristic value of his research. 
€ takes no obvious steps in his book to reconcile any complacent 
Psychiatrists to what might seem a root and branch attack upon 
their beliefs and methods. Psychiatrists, however, are not com- 
Placent unless they are stupid; and Dr. Eysenck is by no means 
lind to the worth of clinical knowledge: clinical categories pro- 
Vide the indispensable starting-point for his irvestigation. His 
argument for an oscillating use, turn and turn about, of criterion 
and factor, clinical assessment and distillate of test-results, com- 
уп 


k takes his stand; he ( 
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mendis itself to the psychiatrist, well aware that current methods of 
diagnosis and appraisal, though less shaky than Dr. Eysenck be- 
lieves, have signal weaknesses; they are the weaknesses to which 


Jaspers drew critical attention thirty years ago and which were 
openly reckoned with in the 


last edition of Kraepelin's great 
textbook. 

The debt of psychiatry to the psychologist is now great and 
growing. Fortunately the indebtedness seems mutual, and the 
association between the two fields of study most profitably inti- 
mate. From these rigorous inquiries, sustained and resourcefully 
developed over years, psychiatry stands to gain an impetus and 
accuracy in some essential matters which will advance йай 
reinforce the free play of clinical skill and insight. 

й AUBREY LEWIS 
Professor of Psychiatry 
University of London : 
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INTRODUCTION 


the results of experiments carried out at the Institute of 

Psychiatry (Maudsley and Bethlem Royal Hospitals) in the 
University of London. Where the other book dealt almost exclu- 
Sively with adult neurotics, we have now extended our scope, 
and the present report deals with normal and psychotic adults, as 
Welt aj with neurotics, and with children as well as with adults. 
The aim has remained the same, namely ‘to discover the main 
dimensions of personality, and to define them operationally, i.e. 
by means of strictly experimental, quantitative procedures’. An 
сар has been made to advance more rigorous proofs for the 
heuristic hypotheses outlined in the previous work, and to make 
deductions, and test these hypotheses, in settings other than a 
mental hospital—among factory workers, nurses, students,teachers, 
and other normal groups. An attempt has also been made to 
indicate the practical usefulness of the methods outlined by employ- 
ing them in studies of the effectiveness of occupational selection, 
both in the normal and the abnornial field, of the effectiveness of 
Surgical operations in the асу он of mental illness, and of the 
factors determining vocational adjustment. Throughout the book 
methods of statistical analysis and proof have been used which 
follow the time-honoured hypothetico-deductive method 5 this 
las occasionally necessitated the introduction of novel techniques, 
Such as the method of criterion analysis. At all times, an attempt 
has been made to imbricate closely the psychological theory which 
Underlies this work and the experimental procedures adopted. 
“шайу, I have tried to eschew that meretricious sesquipedalian- 
sm which threatens to turn psychology from a scientific disci- 
Рїїпе into a stamping-ground for experts in semantics and verbal 
Magic, 

Much of the material here reported was originally delivered in 
е Ё orm of a series of open lectures at the University of Pennsyl- 
Lum during my tenure there of a Visiting Professorship. Other 
Parts of the book were delivered as lectures at the E.T.5. Invita- 
Чопа] Conference in New York, at the 1950 Psychiatric Congress 


i k > 
п Сорепһареп, at a Conference on ‘Current Trends in Psychology 
8.5.р А 1 Я 
Я - 1 


Te: book is a sequel to Dimensions of Personality, and reports 
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in Pittsburgh, at the International Congress of Anthropology m 
Paris, at the 1949 A.P.A. meeting in Denver, at the International 
Psychological Congresses in Edinburgh and Stockholm, at various 
meetings of the B.P.S., and at a number of American Universities 
from Canada to the South, and from California to New England, 
all of whom kindly gave me facilities to put before thcir students 
certain aspects of the methods used and the results achieved. The 
response of these very varied audiences, ranging from lay through 
undergraduate to graduate and professional levels, and having 
orientations ranging from the clinical and psychiatric to the 
mathematical and statistical, has taught me a great deal about 
possible lines of criticism and areas of difficulty. But above +1 the; 
have taught me that an essential part of a book of this kind must 
be a clear statement of the aim and purpose which has motivated 
the investigator. It soon became clear to me that many who on the 
whole were sympathetic to the general tenor of the work were a 
little puzzled as to its main import. This difficulty of ‘placing’ was 
50 universal that I have ventured to include an opening chapter in 
which I have tried to relate the accepted principles of scientific 
methodology to the experimental study of personality, and to 
indicate why in my opinion the areas investigated are important, 
and why. the method used was thought superior to others more 
familiar. This was considered all the more vital in view of the 
wide divergence of aims which is so characteristic of the many 
different types of professiona! workers who are interested in human 
personality. 

Several convictions which I held b 
the research reported here 


rather than аза store-house of revealed truth 
ledge шгу save t 


Searching, and Iraq hi i e. AE ni 
his objective. T 
the Psychologist does a. 
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may be a short-sighted attitude. In science, coming events do not 
always cast their shadows before them, and the growth of a 
genuinely scientific discipline of personality study may be the out- 
come of much that appears at first sight unpromising and irrelevant. 
Both sides have much to learn from each other, and our knowledge 
of human nature will advance more securely and more quickly by 
a pooling of knowledge, experience, and methodology. 

The second conviction, which lay already at the basis of the 
work reported in Dimensions of Personality, is that research in this 
area, in order to be fruitful, has to be what Marquis (1948) has 
called ‘programme research’. In other words, research has to be 
plaited for a whole department around a common objective, it 
has to be continued over a period of years, and the research 
activities of students and teachers alike have to be integrated into 
it. Only in this way can one hope to avoid the inconclusive out- 
come which all too often characterizes the individual short-term 
research. 

Individual students, of course, must be free to choose whether 
they want to take part in the departmental research, or whether 
they want to strike out for themselves; it is our experience that the 
Majority welcome the opportunity of making an active contribu- 
tion to the development of a programme which shows promise of 
furthering our knowledge of human fiature. Such training will also 
Prepare the student for the type of team research which is becoming 
more and more prevalent in the social field, where specialists in 
many disciplines must collaborate in order to solve a problem 
common to them all. 

The third conviction, elaborated in various places throughout 
this book, relates to the use of different types of personality test. 
hile questionnaires and so-called ‘projective’ tests constitute 
Most the only types of test used by ¢ s і : 
research эй; Db investigations described in Dimensions of 

ersonality and here have more and more confirmed my belief ү 
Progress in the scientific study of personality is woe wae i 
UP with what may be called ‘objective behaviour tests’ ( ye 5 
1950a).'These appear to tap more fundamental layers of personality 
than do other types of test. If we deduce personality from рели 

25 surely we must if we want to stay on solid ground) then the 
argument from one item of behaviour to another seems mor 
cogent than the argument from what a person says about his 


al 


i r 
*haviour, or the inference from what he says about a picture or. 


linical psychologists and | 
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an ink blot, to what he actually does. This belief, of ee 
subject to empirical rom 2 15 given here as an heur 
hesis, not as a statement of fact. | 

E last conviction relates to the importance of ши 
work, of finding the dimensions along which we want to d 
before we proceed to carry out any measurement. At the pr F A 
moment, psychology and psychiatry are still in the pre-class a: 
tory stage—älthough some optimists appear to think that ie 
disciplines have already ‘outgrown’ this stage. Attendance г : 
psychiatric case conferences inevitably calls to mind Punch’s com 


hanimal is a bloomin’ hinsect.? 
the systems of classification now in use, 
tically of little value, theoretically in 
contradictory. Yet, for 
in the book, I cannot 


ting concepts and proce- 
alidity, and treasure those which 
othetic-deductive method. It is as 


а report on the first Stages of such an endeavour that these pages 


should be read. 
least based on Є 
than sound, they are at leas 
the conceptual fi 
able of growth and change. If this book 
spur on others to devise better methods er 
value, and arrive at more widely relevant concepts, I shall be 
amply repaid for tay labour in writing it. 


completion of a communal 


book, is the Opportunity of 
expressing one’s thanks to those whose help contributed so largely 
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towards w jet : 
This E success it may be deemed to have achieved. 
ants, students m present and former colleagues, research assist- 
as well as £e doni өсү of the clerical and computing staffs, 
Criticism and sm i E SE the United Stirn 

and г 7 3 me greatly in formulating 
аде E x a more clearly. Special malne A 
tomy and Харх! done by A. Petrie on the after-effects of leuco- 
А. Petrie on stud реа problems; by Н. d Himmelweit and 
in children: b р and the measurement of neuroticism 
of mental defi dn Connor and J. Tizard on the employability 
“ОгаЁ-сурс: M by F. Goldmann-Eisler on the etiology of the 
neuroticism: |, е" d on the relation of ‘sense of humour’ to 
gestalt laws a y SUN ranger on thc applicability of atomistic and 
(now Mrs сүнеп judgments; by А. Clark and A. Gravely 
Personality; к perceptual and motor tests аз measures of 
Word Gonne T ‚ Grown оп after-effects of leucotomy, and on the 
the нийл ж List аз a measure of personality; by D. Prell on 
subjective энэ of neuroticism; by M. Israel and D. Vinson on the 
tion; by A. Hero the objective method of Rorschach interpreta- 
unskilled E be nae on the objective measurement of personality in 
by A. Mea (ырь аѕ related to work adjustment and productivity; 
on the validit 5 28 the reliability of the Rorschach test; by S. Сох 
tion of ис нх of the Rorschach tests by A. Lubin on the applica- 
y J. Mees кш function techniques to psychological problems; 
Бу F, г ы е assumptions underlying Holzinger's h* statistic; 
: Yukviss xS the Character Interpretation Test, and by 
researches, . Malloy who carried out the testing for various 

С А 

diene were carried out by th 
ay, ойы Department, under the direc 
Carried он Or about one year, A. Jonckhecre. 
ingleton: s burden were D. Burgess, ! 
ollerith мэх atter also bore the main responsi 
and vob зена contributed greatly by his advice on apparatus, 
Workshop E due to him and to J. Westhenry and W. Whithers, 
ieee on whom fell the burden of constructing 
connection, ning apparatus. J. Standen was also-helpful in this 
the е now Mrs. McIntosh, typed th 
3, and assumed responsibility for t 


е statistical section of the 
tion of A. Lubin, J. 
The computors who 
I. Wiltshire, and J. 
bility for the 


e manuscript, checked 
he bibliography; her 


= e 
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uick and accurate work considerably lightened the load of 
authorship. D. Webb drew the figures with skill and competence. 
F. Loos and S. Jeff took the photographs, which were posed by 
various members of the Psychological Department. 

Professor P. E. Vernon read the manuscript and saved the 
reader from a good measure of repetition and obscurity; the writer 
can hardly refrain from accepting full responsibility for what has 
remained of these twin faults. 

«Professor Aubrey Lewis gave constant support to the research 
programme of the Department, and it is no exaggeration to say 
that without him this work could not have been completed. Many 
other colleagues and friends at the Bethlem and Maudsley: Hos: 
pitals contributed of their time and advice, and lent their support 
to our plans; they are too numerous to name here. 

My wife contributed directly to this research by her great skill 
in testing difficult and sometimes hostile patients; this, however, is 
not the only reason why this book is most fittingly dedicated to her. 


t 


Institute of Psychiatry 
Maudsley and Bethlem Royal Hospitals 
University of London 


H. J. EYSENCK 
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Chapter Опе 


SCIENCE AND PERSONALITY 


HE word ‘science’ has great prestige value in our society. 
Like other words having great prestige value it is used in 
„~ many contexts where one may legitimately doubt whether 
thd 86 who use it mean by it what philosophers, logicians, an 
scientists themselves have agreed upon as the correct. usage. 
Scientific palmistry', ‘Christian science’, ‘scientific marriage 
Counselling’, and ‘principles of scientific Marxism’ are-catchwords 
Which use an honoured term as.a bait; their relation to science is. 
at best tangential and usually non-existent. In the examples given, 
the deception will be obvious to most readers; in other cases We 
May not be so sure, and some kind of criterion may be required to 
Judge accurately. If we look at the flood of articles dealing with 
Personality’, ‘clinical psychology’, ‘counselling’, ‘psychoanalysis’, 
lagnostic testing’, ‘projective techniques’, and similar inter- 
д concepts, and attempt to judge their value from the point 
ОЁ view of science, we may be permitted at least a doubt as o 
Whether all these contributions can Бе called genuinely "scientific 
"indeed, in а more jaundiced mood we may wonder if any of 
them deserve this title. t 
Tuch argument on this point has suffered from a lack o 
definition of the term ‘scientific’, both parties to the argument 


Using this word in different senses. Finding their contributions 
many workers in 


d the meaning of 


Md emptied of precisely those € 

wo о this esteem. This procedure 0 a e 

E Eds is familiar in the political ficld, where words 

а: €mocracy’, which have relatively clear meaning and content, 

fo, к ed by some who wish to retain the prestigg Value of the term 
T régimes which clearly do not fall under this heading, to mean 


апове Precisely the opposite to what the word has hitherto stood 
т. Semantic manceuvres of this type create much confusion and. 
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necessitate a careful analysis of the meaning of the words used. 
They are illustrative of the Alice-in-Wonderland atmosphere 
which pervades so much of this field. ‘When I use a word', said 
Humpty-Dumpty, ‘it means just what I choose it to mean— 
neither more nor less.’ ‘The question is,’ said Alice, ‘whether you 
can make words mean different things.’ "The question is, said 
Humpty-Dumpty, ‘which is to be Master—that’s all? 

There are clearly two kinds of psychology, just as there are two 
kinds of physics. Eddington (1928) has contrasted these two types 
of physics by writing about his two writing tables—the sensible 
table, as perceived by him, and as accepted by common sense, 
solid and impenetrable—and the table as modern physics ves it, 
being constituted mainly of empty space, with a very large number 
of very small electrons, protons, neutrons and other bodies and 
electric charges moving in this space. Absurd as this second con- 
ception of the table may sound tothe naive observer, we are willing 
to accept it because by accepting it, and all the concepts and laws 
which are.involved in it, we are able to bring order into the great 
mass of facts which the world presents to us, and because predic- 
tions made on this basis have a habit of being borne out when we 
apply an empirical check. Physics started from the actual table as 
presented to sense experience, but it has left it a very long way 
behind. In doing so, it has ао lost a good deal of the feeling of 
certainty which attaches to common-sense observation, replacing 
it by an attitude of tentative belief in the relative accuracy of 
current hypotheses. 1 

In a similar way, psychology starts out with the observations 
of common sense. In certain limited fields, it has succeeded in 


1 This opposition between science and common sense is well b 
Galilco in his famous Dialogues (1661): rought out by 


Ч cannot sufficiently admire the emin 
received and held it to be true 


Einstein recently emphasized a similar point: 
"Advances in scientific knowle 
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getting beyond common sense, and in elaborating concepts and 
laws which bring order into a relatively small field and which 
enable us to make moderately accurate predictions. In other fields, 
it is still painfully close to common sense and has failed signally to 
elaborate concepts and laws of its own. In most fields, it shows a 
stubborn tendency to cling to feelings of certainty and to refuse to 
hold tentative beliefs, and to regard current hypotheses as at best 
very rough-and-ready approximations to those likely in the long 
run to be found most useful. 

In contrasting common-sense observation and scientific 
theory, we do not mean to imply that common sense is invariably 
wronge«ad science invariably right. The difference lies essentially 
in the method by means of which one’s opinion is arrived at. If this 
method follows the dictates of-science, the result may be wrong, 
but it will be a wrongness that is self-correcting. If the method is 
not that of science the result may he right, but we have no way of 
knowing that it is right. Ifit is wrong, we have no way of correcting 
it. Many firmly held beliefs of common sense are wrong, although 
they may superficially have much to recommend them. Others 
may be right or wrong; we have no evidence to decide. 

The two kinds of psychology which we have distinguished— 
that of common sense and that of science—do not differ only from 
the point of view of method; they also seem to differ from the point 
of view of aim. Common-sense psychology secks to understand. If 
another person acts in the same way that I would act, then in some 
way I may say that I understand his actions. If he acts in a way 
different to the way in which I would act, but nevertheless 
familiar to me because I have seen many others behave in this 
fashion, I still feel that I understand him. If his action is un- 
familiar to me, I may try to understand it by reducing itto familiar 
motives experienced in unusual circumstances. Much of psychiatry 
consists in such an attempt at understanding, and it is often stated 
that the good psychiatrist or psychologist must possess empathy 
which enables him to ‘feel himself into’ his patient. 

The aim of science is quite different, and rather more austere. 
Mach (1942), Ostwald (1911), Pearson (1911), and many other 
scientists have maintained that the only real world ва sensible 
world, and that scientific theories are merely descriptions of the 
sensible world. In other words, science aims at description. This 
dictum is easily misunderstood. ‘A complete description of e 
phenomena is impossible, and if it were not impossible it would'be _ 


To F SCIENCE AND PERSONALITY 
useless, for the aim of science is to make the primary data in- 
telligible by exhibiting their mode of connection. Without abstrac- 
tion this would be impossible but a complete description would not 
‘permit of abstraction’ (Stebbing, 1930). Obviously, then, descrip- 
tion may be understood in two senses. We may refer to description 
-atan elementary level, as when we describe minutely the charac- 
teristics of a given table, or a blade of grass, or a whale. Or we 
may refer to description at a higher level, as when we describe the 
movements of the planets in terms of parabolas, or the behaviour 
of electric particles in terms of field theory. This latter type of 
description is often called ‘explanation’, because by referring the 
more elementary, individual phenomena observed at the uw level 
of description to the laws which are abstracted from these cle- 
mentary facts, and which constitute the higher level of description, 
we often feel that we have ‘explained’ the observed facts. All we 
have done, of course, is to giye the individual fact a place in a 
unified, consistent system of description; more than this science 
does not attempt to do. If it is clearly understood that the term 
‘explanation’ does not carry any overtones of intuitive or empathic 
understanding, no anthropomorphic ‘feeling oneself into’ things, 
but stands merely for the more abstract level of description, there 


is probably no great danger in using that term, and it will be so 
used in the remainder of tkis chapter. 
Science, then, tries to describe and 
follows certain rules which ex 
sable to the development of 


to explain. In doing so it 
perience has showr to be indispen- 


descriptions so comprehensive as to 
€ a Н 
deserve the name ‘explanation’, Let us start with an example con- 


trasting non-scientific and scientific explanation, using planetary 
motion as the problem to be explained. ‘The ancient Egyptians, 
starting from the assumption that the universe is like a large box 
of which the earth forms the bottom and the sky the top, supposed 
the stars to be lamps cither carried by gods or suspended by cords 
from the top of the box. The sun was supposed to be a god, Ra, 


carried daily in a boat along a river of which the Nile was a branch. 
He was born every morning. 


ў > Brew in strength until noon, when he 
turned into another boat. Finally, he was carried in another boat 
during the night back to the east. The eclipse of the sun was 
accounted for by the Supposition that a huge serpent sometimes 
attacked the boat, A simil 
eclipses and the phases of th 
Из enemies. A sow attacks 
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and after a fortnight's agony and increasing pallor, the moon dies 
and is born again. Sometimes the sow manages to swallow it. 
altogether for a time, causing a lunar eclipse. "WEST ac 

_ ‘Although these suppositions do not involve flagrant contradic- 
tion of the facts upon which they are based and although they are 
the outcome of a desire to explain these facts, "nevertheless, this — 
Egyptian doctrine is in no sense scientific. Its failure to be scientific 
is not due to the fact that it rests upon unproved assumptions. 
Every scientific theory rests ultimately on such assumptions. It 
fails because these assumptions were of such a kind that there could 
be no further evidence in support of them. They were essentially 
unverifiable; they were not susceptible of development; they did 
not suggest the deduction of observable data. . . . The Egyptian 
theory, therefore, could neither be developed nor tested. But a 
theory is scientific only if it admits of testing and development’ 
(Dreyer, 1906). 

As an example of scientific explanation, we may take Newton’s 
law of gravitation. Included in his descriptive scheme are all the 
facts which enter into Ptolemy’s mathematical description, into 
Copernicus’ heliocentric scheme, and into Kepler’s three laws of 
planetary motion. All these facts were not merely fitted into 
Newton’s system; they could be deduced from it. And once the law 
had been enunciated, such apparently independent and discon- 
nected facts às the phenomenon of the tides, planetary motions, 
the precession of the equinoxes, the phenomenon of weight, the 
motions of cyclones, and the orbit of the comet could all be ex- 
plained in terms of the inverse square law of universal attraction. 

‘Such deductive power is a mark of constructive description. 


. .. By its comprehensiveness, by its deliberateness in leading to 


new discoveries, in suggesting fresh experiences, and in connecting 


what was hitherto disconnected, it achieves the aim of science 1n 


making the multiplicity of facts intelligible. Such a theory may be 
said to explain the sensible facts that constitute its data since it 1s 
сайз deductively back to them. 


not only based upon them but leads i 
ther sensible facts which were not 


Further... it leads on to 0 s > : 
known and thus do not form part of the original data (Stebbing, 
1930). 


f co-ordination achieved. by a,theory, 
by a small discrepancy. A construc- 
possesses a high degree 
t to a comprehensive 


. The greater the degree o 
the more it is liable to be upset 
üve description, such as Newton's theory, 
of co-ordination; it is coherent with respec 
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range of facts. A single fact which is in contradiction to the 
hypothesis may necessitate its rejection. Occasionally the theory 
may be salvaged through the use of an ad hoc hypothesis which 
accounts for the discrepant fact, and which can later on be verified. 
Thus, for instance, an irregularity in the orbit of the planet 
Uranus was accounted for by the ad hoc hypothesis that another, 

_ hitherto unknown planet, was responsible for these aberrations. 
This planet had to have certain qualities which could be specified 
precisely in accordance with Newton’s laws, and when its position 
had been calculated on the basis of the theory, Neptune was 
actually discovered in the precise place required according to 
theory. Another irregularity in planetary motion, namely;-that in 
the orbit of Mercury, was also explained by le Verrier on the basis 
of a similar ad hoc hypothesis, involving an interior planet which 
he called Vulcan. Careful observation failed to revcal any such 
planet, and the hypothesis was, discredited. 

‘A multiplication of ad hoc hypotheses is contrary to the aim of 
science which seeks the greatest possible co-ordination of facts. 
Every additional ad hoc hypothesis marks a breakdown of the co- 
ordination. . . . An ad hoc hypothesis necessarily explains a given 
discrepant fact since it has been introduced solely for that purpose. 
If we can appeal to an ad hec hypothesis whenever there is a 
discrepancy between theory: and fact, then we can explain every- 
thing and foretell nothing. It is only permissible to introduce an 
ad hoc hypothesis when consequences can be deduced from it which 
are later verified. Otherwise an alternative method for dealing 
with a fact discrepant with a theory has to be resorted to, namely, 
the rejection of the fundamental assumptions upon which the 
theory is based’ (Stebbing, 1930). 

This is the course followed by Einstein in his attempts to 
account for the negative result of the Michelson-Morley experi- 
ment to detect the earth’s velocity relative to the ether. The experi- 
ment showed that contrary to Newton’s laws the motion of the 
earth has no influence upon the propagation of light over the 
earth’s surface. While this discrepancy is minute, it nevertheless 

ypotheses, or a rejection 
the success and immense 
tion it had to be rejected 
less it were saved by the 
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gests an explanation of the discrepancy which is of a kind already 
familiar, then the hypothesis is not objectionable. It does not con- 
flict with the aim of science. When, however, it is essentially un- 
verifiable since it suggests what transcends observation, or appeals 
to effects which neutralize each other (as was the case with the 
Lorentz-Fitzgerald contraction suggested to account for the 
Michelson-Morley experiment), then it is not admissible. 

_ It is an interesting aspect of many modern theories of person- 
ality (particularly the analytic and so-called dynamic ones) that 
they capitalize precisely on these two inadmissible and unverifiable 
types of ad hoc hypotheses. The concept of reaction-formation, for 
instan&, implies that cause A may result in behaviour-pattern X 
as easily as in non-Y—and presumably anything in between. One 
dynamic may cancel out another. Prediction is thus impossible, 
and indeed hardly ever attempted. ‘Verification’ consists not in 
stating an hypothesis, making deductions, and then seeking 
evidence regarding these deductions; it consists in first obtaining 
the evidence and then seeking an explanation, which is invariably 
found through the invocation of unverifiable ad hoc hypotheses. It 
can hardly be argued that this type of procedure is in line with the 
precepts of science, as ordinarily understood. 
T То quote but one example from the rese 
illustrate this tendency, we may take Symonds’ book on Adolescent 
Fantasy (1949). In it he gives a set of correlations between teachers’ 
ratings of forty children studied in grea detail, and their ‘fantasy 
themes’ as determined from the Thematic Apperception test. One 
hundred and seventy-five such correlations are given on page 368 
of the book; they have been plotted in Figure 1 below. It will be 
Seen that they cluster around zero, and that in view of the high 
sampling errors involved in the calculation of correlations from 
small numbers of cases these figures do not in any way deviate 
from a kind of distribution expected on the basis of the null 
hypothesis, i.c. on the basis of the hypothesis that there is no 
Correlation whatever between rating and T.A.T. score. This is the 
only conclusion which can legitimately be drawn from these data, 
апа in view of the far-reaching claims which have been made for 
this ‘test’, it is one which is of great interest and ao, hen is 
a dug e t a the S miet 

mpirical figures from chance expe Я 
7 = 0 is -16 for forty cases, 50 that one would expect 68 per cent 
ОГ all observed correlations to lie within the limits ef + 16 and , 


arch literature to 


Я : : 
~~- ‘These consistencies and regularities . . . fit i 
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— ‘16. Actually the number of these effectively zero correlations 
is slightly larger than this, so that if anything the ratings and the 
T.A.T. scores are correlated to an even slighter extent than would 
be expected on a purely chance basis, However, the correlations 
are not independent from each other, so that no exact test can be 
made; such tests apply only to correlations which are themselves 
uncorrelated. This lack of independence also invalidates Symonds’ 


-50 


— Figure r.—Distribution of correlations between Thematic Appreciation 


” й "Test scores and teachers? ratings 


rm 


argument that ‘there is a certain consistency in these relationships 
et that makes one believe that they have a higher degrec of 
validity than the statistical probability of departing from chance 
would permit one to assume. Variables of opposite meaning will 
have relationships 9n opposite sides of the correlation zero point.’ 
Such consistency is in no way proof of the extra-chance character 
of sets of Correlations; it can be deduced from the laws of prob- 
ability directly, f a 


. This brings us to the last argument advanted by Symonds. 


n too. well with the 
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c ed of the data to attribute them entirely to chance.’ They 
re "verified by their dynamic significance". As an example 
Symonds quotes the fact that ratings of ‘good adjustment’ correlate 
Positively with fantasies of anxiety. This is interpreted as meaning 
that ‘the good adjustment was a reaction formation which kept the 
individual from recognizing his anxiety and gave him a way of 
defending himself against it’. Those with poor adjustment, it is 
claimed, ‘did not show anxiety in their stories, because the anxiet 
expressed itself symptomatically in their poor adjustment to life’. 
Here we have the full flavour of the type of argument char- 


acterized as unscientific in our preceding discussion. A correlation 
of anxiety’ 


D founa, between ‘good adjustment’ and ‘fantasies о! 
Which is only just larger than its own standard error. Ifthis correla- 
tion had been in the opposite direction, it might have been posible /.— 
to argue that it was in the expected direction, and that there is : 
nothing inherently absurd about maladjusted children having 
fantasies of anxicty. The fact that the correlation is in the wrong. 
direction, however, does not worry the ‘dynamic’ psychologist; 
indeed, it is expressly cited because it is ‘verified’ by its ‘dynamic’ am 
significance’. All this is accomplished through that ready hand- ` 
maiden, ‘reaction formation’. The author is not concerned about — - 
the fact that in other projection tests, such as the Sentence Com- 3 
pletion Test, there is a significant and nagative correlation between 
adjustment arid fantasies of anxiety; his ‘explanation’ is a purely - 
ad hoc one which,can of course be rclied apon to ‘explain’ the facts 
which it was constructed specifically to account for. Nor is йел: 
author worried by the fact that he is forced to make assumptions : 

of such magnitude as would make less ‘dynamic’ writers wince— 
assumptions, for instance, that good adjustment is а defence. 
against anxiety! When it is rcalized that all this elaborate appar- - 
atus is set in motion, and all these tremendous theories ane —. 
assumptions put forward, to explain a correlation which is com- 
pletely insignificant, it will be apparent why such procedures are 
considered antagonistic to the spirit of science, and unlikely to 


advance our knowledge of human behaviour. A > : 
ur knowleage ts to the study of personality, ч 


When we їг ly these prece 
We find НӨ ЧИН нае hal Хүн, for itself a dangerous 
Course, threatened by the Scylla of nebulous Freudian general- 
izing, and by the Charybdis of pointlessly amassing unrclated- 
facts, Thus we have two great groups of workers whose efforts 
bring extraordinarily little return for the labour expended: On the | 
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one hand the Nco-Freudian, Crypto-Jungian group of “intuition- - 


ists’, who scorn the pedestrian methods of scientific verification, 
who base their conclusions on very small biased samples, and 
whose alleged ‘facts’ are themselves only interpretations according 
to very dubious canons of evidence; on the other, the mole-like 
calculators of innumerable correlations between measures taken 
without any theory or hypothesis capable of proof or disproof, who 
mistake the collection of a thousand unrclated “ас1$? for science, 
who seek to atone for the scientific barrenness of their results by 
stressing the mathematical be 


and who seem incapable of seeing the wood for the trees. In other 


words, there appears to be a definite schizophrenic interlude in the 
development of a scientific psychology of personality; the building 
up of hypotheses and theories is divorced from the patient collec- 
tion of facts. ‘Dynamic’ psychology is almost entirely devoted to 
an exploitation of ad hoc hypotheses which are either unverifiable 
or appeal to effects which neutralize each other; ‘actuarial’ psycho- 
logy is almost entirely devoted to statistical analysis without 
benefit of theory or hypothesis. It is small wonder that neither 
separately nor conjointly do these two ‘schools’ succeed in con- 


structing a firm, scientific foundation on which to build. What is 


needed in psychology, as in any other science, is greater under- 
standing of, and more int 


2nsive use of, the hypothetic-deductive 
method, in which a clear, unambiguous hypothesis is stated, 
deductions, preferably cf a quantitative kind, are made, and 
experiments performed to verify or disprove the hypothesis. 
( i may summarize our discussion so far by quoting Stebbing 
1930): 


"Scientific thinkin 
essentially methodic 
Successful, issues in 
hended as fragment 
Organization is achi 


say that a set of elements 
©з à system when, 


a whole, or system 
f bee Day: : exhibits order and there 
ore becom given the properties of some of thë 
f the set, the prope 


rties of other members of the set, 07 
some of them, аге thereby determined. This determina- 


auty and purity of the methods used, ү 


Бий. 
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tion is due to the relation that orders the set; it is not a property of 
the elements regarded as a class. 

‘Therelevance of the subject of order to systematic inquiry may 
be regarded as evident. Scientific thinking essentially consists in 
the organization, or co-ordination, of the facts with which it deals. 
A collection of disconnected facts does not constitute a science any 
more than a mob of men constitutes a college, or a heap of bricks 
а building. The sciences are concerned not with facts as such but 
With ordered facts. One main difference between the earlier and 
the later stages of science is to be found in the growing prominence 
of order. What is ultimately important for science is the type of 
order rather than the elements that are ordered. It is for this 
reason that the most highly developed science; theoretical physics, 
appears to be concerned with.something far removed from what 
the plain man would regard as a fact. Science has its origin in the 
attempt to co-ordinate the facts of sensible experience but in the 
discovery of the appropriate type of order it passes to the considera- 
Чоп of a different order of facts. А k 

‘Ultimately, the explanation of facts is to be found in their 
organization into a system. Thus, a mere collection of observable 
facts does not suffice to constitute a science. There is required ш 
addition a certain kind of attitude to the facts and a certain kind 
of predominantly logical method. Theeequirements of this method 
determine whether a given observable fact is a possible datum for 
Science. The scientists, that is to say, select those facts that can be 
treated in accordance with this method. We have then, two char- 
acteristics that belong to science; the selection of a certain kind of | 
facts and the use of a certain kind of method. The scientist is er 
cerned with the correlations of sets of properties. Hence, а scientif с 
Proposition is ultimately of the form: Whatever has the шин 

Ва їе property Y. А scientist reaches such a proposition Ex 
Propositions of the form: This A, this B, this С, gia ae Бос 
Perties X and Y. That is to say, he кен СО цай 
18 Scientific method. The scientist, then, considers particular facts 
only in order to obtain generalizations of increasing abstractness. 
thi observation involves abstraction, that is, selection from some- 

ing that is also there to be observed. ae 28 

“The method of science sometimes called the inductive ШЕШ 
has four essential stages. The first of these 15 constituted by the 


awareness of a familiar complex situation in which some Be fact 
8.8.Р, 


. plain a fact when 
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is apprehended as peculiar, and where the need is felt to account 
for this fact by connecting it with the total situation in which its 
occurrence would not be unexpected. 

"The second stage is the formation of an hypothesis which 
would connect the unexpected fact with other facts. 

‘Third comes the deductive development of the hypothesis, and 

esting of the consequences deduced from the 
hypothesis by appealing to observable facts, 

“With regard to such an investigation we may be said to ex- 

we have shown that it is connected in an orderly 
manner with other facts that do not have to be explained.’ 

This general point of view js opposed by those who hold what 
is often called the idiographic as contrasted with the nomothetic 
point of view. These terms, coined by Windelband (1921) and 
introduced into Anglo-Saxon psychology by Allport (1937), refer 
to attempts to ‘understand some particular event in nature or 
society’ as opposed to the seeking of ‘general laws and employing 
only those procedures admitted by the exact sciences’, The main 
battle-cry of the idiographically minded 
"uniqueness of the individual’, * 


(Allport, 1937). 
The idiographic vi 
logists who have t 


Orrow'. То the 
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three quantitative variables, hue, tint, and chroma. A combination 
of perfectly general, descriptive variables is sufficient to allow any 
individual to be differentiated from any other, by specifying his 
position on each of these variables in a quantitative form. Many 
writers seem unable to see that one individual can differ quantita- 
tively from another in many variables, common variables though 
they may be, and still have.a unique personality? (Guilford, 1936). 
Quite on the contrary, the very notion of ‘being different from’ 


implies at the same time the idea of direction and the idea of ~ 


amount—in other words, the unique individuals cannot meaning- - 
fully be said to be different from each other unless they are being 
compared along some quantitative variable. Uniqueness, there- 
fore, is not in any sense a concept antagonistic to science; it 
follows from the methods used in science to describe individual 
events in terms of common variables. 

The second point relates to the importance attributed to the 
unique. ‘Science is not interested in the unique event; the unique 
belongs to history, not to science.’ Yet clearly applied science must 
deal with the unique, with the individual. We cannot build bridges 
in general, but only a particular bridge; we cannot cure patients 
as such, but only a particular patient. In dealing with the indi- 
vidual case, we must make use of such general laws as may be 
known to apply; our success in dealing with the individual case 
will be determined by how much is known by science regarding 
the laws governing this case. It is for this reason that bridges are 
built with greater success than has hitherto attended our efforts 
at curing patients by psychotherapy; it is not that patients are 
‘unique’ and ‘individual’, and bridges not—a bridge is as unique 
and individual as a patient—but because the laws governing the 
behaviour of bridges are very much better understood than the 
laws governing the behaviour of patients. Here indeed we have the 
best possible defence of current psychiatry against charges that its 
procedures are unscientific: as long as psychology fails to discover 

` general laws applicable to the psychiatrist’s practical problems, 
the, psychiatrist must use whatever methods common Sense, 
practical experience, intuition and theory dictate. He cannot tell 
his patients to wait until the glorious sun of science comes forth to 
illuminate the scene; he has to deal with them here-and now. As 
long as the psychiatrist does not claim to haye solved the scientific 
problems of personality, the psychologist certainly has no right to 
blame him for making use of any and every possible weapon he 


E 
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can lay his hands on, or for making up his own conceptual 
schemata. Nothing that could be called a science of personality 
exists at the moment, and psychiatrists can hardly be blamed for 
refusing to base their procedures on the mere hope of the futurc. 
Pragmatism is therefore currently the only method open to 
psychiatry and clinical psychology; it is perhaps unfortunate that 
50 many practitioners, instead of keeping an open mind in the 
very undeveloped state of the subject at the moment, have instead 
fallen prey to a *premature crystallization of spurious orthodoxies’. 

Another shibboleth, second only to that of ‘uniqueness’, is that 
of the ‘total personality’, In a sense, again, what those who use this 
phrase mean is so obviously true that no one would dissent from 
their stress on studying personality zn toto, as it were, and not hack 
it up into arbitrary, unconnected. parts. It would not be truc, 
however, to imagine that science holds a point of view opposed 
to such insistence. Even in physics, relativity theory lays great 
emphasis precisely on this ‘interconnectedness’ of all the phenom- 
ena which go to make up its field of study, and in psychology 
attacks on ‘atomists’ are largely attacks on men of straw, not on ` 
points of view held by any responsible psychologist. It is in the 
` negative connotations of this phrase, rather than in the positive 
ones, that one may discern certain specious arguments. ‘Thus 
believers in the ‘wholeness’ of personality often decry any type of 
analysis, and declare that analysis destroys what it seeks to study. 
Instead, they advocate methods and techniques which are s 
‘study personality as a whole’, 

It is difficult to attach any more me. 
to а claim to study ‘the universe 
tion of the methods actually used by those w 
phrases show anything very different from the ordin. 
used for many years—unless it be a lack of reliabili 
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them. (5) A general estimate of his intellectual level and the major 
qualitative features of his thinking. (6) A general estimate of the 
degree of security or anxiety, of balance in general, and specific 
unbalances. (7) The relative degree of maturity in the total per- 
sonality development.” The author is careful to point out that this 
is not all that the Rorschach can tell us. He goes on to say that 
‘this list does not represent a complete account of the personality 
aspects revealed by the Rorschach method’. Apparently the test is 
also ‘remarkably effective in estimating the intellectual status of 
an individual; in revealing the richness or poverty of his psychic 
experience; in making known his present mood; and in showing 
the extent of his intuitive ability as well as in disclosing special 
talents and aptitudes’. Furthermore, ‘it detects anxieties, phobias, 
and sex disturbances, as well аз more severe disorders, and serves 
as a guide for appropriate treatment’. 

According to this widely-accepted text, then, we are not deal- 
ing with the whole personality at all; we are dealing with creative 
and imaginative capacities, with intellectual level and status, with 
security, anxiety, balance and unbalance, with moods and 
intuitive abilities, with phobias, sex disorders, and other disturb- 
ances. It is not easy to see, then, how this treatment differs from 
the traditional analysis of personality into cognitive, conative, and 
affective areas, into abilities, temperanfent, character, intelligence, 
traits, and so forth. Obviously the total personality has to be 
analysed into smaller units before we Can study it at all; we are 
thrown back to the question of what is the best method of analysis, 
and how we can determine its validity. There is no book on per- 
sonality, and no test or technique, which deals with the unique, 
total personality; these words are merely used as camouflage to 
screen the absence of acceptable validity and reliability. 

What has been said here of the Rorschach test applies with 
equal force to the psychoanalytic technique. Here also we do not 
encounter a unique, total personality; we find Oedipus complexes 
and super egos, regressions and transferences, cathexes and 
libidos, fixations, symbolisms, compensations, catharses, Narcis- 
sisms, and many other strange entities jostling each other; all 
of them attempts to analyse what is elsewhere declared to be 
‘unanalysable’, e ол ¢ 

Surely this whole argument rests on a misunderstanding. What 
we are dealing with, in psychology as well as in physics, are certain 
entities, parts, ‘sub-wholes’, or whatever they may be valled, which . 
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stand in certain relations to cach other. It is vitally important to 
identify the correct parts or sub-wholes; it is equally vital to dis- 
cover the relations obtaining between them. It is absurd to call 
one of these tasks more important than the other; they are 
mutually interdependent, and progress in science is impossible 
without paying due regard to both. To say that orthodox psycho- 
logy is ‘atomistic’, and is interested only in the parts, to the neglect 
of the relations between them, is manifestly untrue; techniques of 
curvilinear regression, of curve-fitting, and of trend analysis show 
clearly the concern statistical psychologists feel about the exact 
form of the relation obtaining between the various entities dealt 
with. What is important is-not whether one party or the other to 
this dispute is right; the main consideration must be that each side 
should put its point of view in such a way that an empirical test, 
a crucial experiment, becomes possible. The question of the rela- ` 
tion between the variables fourd useful in psychology must be an 


лее problem; it cannot be solved on the philosophical 
evel. 


c ‘petites perceptions’, 
if the historical parallel be permitted), while the calculating 
machine cannot аз fruitfully deal with such large numbers of 
other in complex ways. 
ving, of intuitive inter- 
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subjective methods at least to suspend judgment until he can quote 
equally impressive evidence in his favour. ; 

First we may quote the conclusion arrived at by the Bureau 
of Naval Personnel, in their large-scale studies of prediction of 
success at training schools (Stuit, 1947). “The improvement in 
predicting school success by having in addition to test scores an 
interviewer's evaluation of experience, interest, and personality, 
is relatively small and may well be negative.' The actual figures 
quoted show that in many cases the addition of the interview to 
the straightforward prediction based on test scores alone lowers 
predictive accuracy; in no case is there a striking improvement. 
In view of the very large numbers involved, 37,862 trainees having 
been studied altogether, this conclusion and the data on which it is 
based deserve most careful study. 

Another study giving similar results, although in a different 
sphere, is the experiment on student selection. conducted by 
Himmelweit (1950, 1951). One of her aims was the comparison 
of existing methods of selection with objective test techniques. (A 
more detailed report of her. results with objective tests of person-, 
ality will be found in a later chapter.) One of the traditional 
techniques much valued by the University was the interview. ‘The 
main object of the interview is to assess the candidate's suitability 
to pursue a course of study, special consideration being given to the 
following factors: (а) general intelligence. (b) Previous education, 
training and experience. (c) Interests end motivation. (d) Person- 
ality and character.’ None of the tests of intelligence and tempera- 
ment which proved to be successful predictors of student success 
correlated significantly with the interview; indeed, there was a 
tendency for low negative correlation coefficients to predominate. 
When correlated with success at University the interview again 
failed to show a significant correlation; its predictive accuracy was 
indicated by a correlation coefficient of -067! In view of the rela- 
tively high correlations achieved by objective tests, we find again 
that belief in the integrating faculties of the human mind is not 
Supported by experimental findings. à 

Probably the most impressive, as well as the most conclusive, 
Study, however, is the experiment carried out by Kelly (1949) on 
the selection of clinical psychology students. Using à plethora of 
different methods—interviews, ratings, objective tests, projective 
tests, intelligence tests, sociometric techniques, interest and atti- 
tude schedules and questionnaires, and following tke students up 
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over several years, he concluded that ‘the most valid individual 
clinical judgments tend to be those based on relatively incomplete 
data regarding the assessee. In general, clinicians seem to make 
their best predictions on the basis of materials contained in the 
credential file and an objective test profile. Subsequent predictions 
based on increasing amounts of data, e.g. the addition of the auto- 
biography, the projective tests, the interview, and the situation 
tests seem to have resulted in successive decrements in validity, as 
cach additional type of data was added to the picture. This finding 
appears all the more significant when it is remembered that 
assessment staffs have uniformly been of the opinion that the inter- 
view contributed relatively most to their understanding of the case.’ 
While the report from which this summary is quoted is only a 
preliminary one, its conclusion shows a striking agreement with 
one of the most interesting results of the O.S.S. assessment pro- 
gramme (1948). Using in the main two selection stations for 
recruits, S and W, of which the former devoted roughly three 
times as much time to each recruit as did the latter, with all the 
increase in information, testing, interviewing, and conferences that 
that implies, the authors of the report found that ‘the job ratings 
given after the one-day assessment at Station W were generally 
more valid than those given after the three-day assessment at 
Station ©. The respective’ figures, giving correlations between 
. rating and criterion, are -37 and -53, using the criterion which the 
writers themselves regarded as the most valid. Тһе explanation 
that perhaps the candidates at these two stations differed in such 
a way as to account for the result will not hold; ‘it is not possible 
to say that the candidates who went to W were any easier to 
assess’. The writers conclude: ‘It would be profitable, in the long 
run, for us to assume that the additional information obtained 
by stretching the screening process from one to three days had 
diminished the validity of the final decision. 
е The evidence from all these different writers, with their vary- 
ing biases and divergent outlooks, appears to agree that the human 
brain is not a very good instrument for assessing, weighting, and 
combining many items of information in such a way as to make 
valid Predictions; on the contrary, it is easily confused when pre- 
sented with more than a very few such items, and added informa- 
tion tends to lower, rather than to raise, its forecasting efficiency. 
The evidence is perhaps not conclusive, but it is remarkably 
unanimous, : 


y 
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The history of science thus gives us guidance as to the method- 
ology to be employed in scientific work; does it also help us in 
deciding which of the many pressing problems to attack first? 
There appears to be one type of problem which is fundamental to 
all progress in the study of personality, and which must find at 
least a preliminary answer before we can hope to advance in any 
direction at all. This problem is that of taxonomy or classification, 
and it can best be introduced by using a simple example of how 
other problems tend to lead back to it. 

It is believed nearly unanimously by psychiatrists, psycho- ` 
analysts, clinical psychologists, counsellors, and the lay public 
that psychotherapy has the effect of alleviating, partly or wholly, 
the illnesses of the neurotic. A look at the evidence for this belief 
will show that it is a commornesense belief, not one which could be 
considered scientific. This does not mean that the belief is wrong; 
it merely means that no scientific evidence has been produced to 
justify it. All we have are statements of subjective feelings by 
psychotherapists regarding the outcome of their labours; it must 
be obvious that these statements cannot be taken as scientific 
evidence, in view of the long list of mistaken beliefs of this kind 
which history gives us. ab». 

It may be of interest to look at such evidence as there is, and to 


belief. In the only previous attempt to carry out such an evalua- 
tion that has come to hand, Landis 61938) has pointed out that 
*before any sort of measurement can be made, it is necessary to 
establish a base line and a common unit of measure. The only 
unit of measure available is the report made by the physician 
stating that the patient has recovered, is much improved, is 
improved or unimproved. This unit is probably as satisfactory as 
any type of human subjective judgment, partaking of both the 
good and bad points of such judgments." For a base line Landis 
suggests ‘that of expressing therapeutic results in terms of the 
number of patients recovered or improved per 100 cases admitted 
to the hospital’. As an alternative, he suggests ‘the statement of 
therapeutic outcome for some given group of patients during some 
stated interval of time’. . 
Landis realized quite clearly that in order tc evaluate the 
effectiveness of any form of therapy, data from a*control group of 
non-treated patients would be required in order to compare the 
effects of therapy with the spontaneous remission rate. In the _ 


+ 


discuss possible methods of investigating the truth of the general (| , 


26 SCIENCE AND PERSONALITY 


absence of anything better, he used the amelioration rate in state 
mental hospitals for patients diagnosed under the heading of 
‘neuroses’. As he points out, ‘there are several objections to the use 
of the consolidated amelioration rate... of the... state hos- 
pitals . . . as a base rate for spontaneous recovery. The fact that 
psychoneurotic cases are not usually committed to state hospitals 
unless in a very bad condition; the relatively small number of 
voluntary patients in the group; the fact that such patients do get 
some degree of psychotherapy especially in the reception hospitals; 
and the probably quite different economic, educational, and social 
status of the State Hospital group compared to the patients re- 
ported from each of the other hospitals—all argue against the 
acceptance of (this) figure... asa truly satisfactory base line, but 
in the absence of any other better figure this must serve.’ 
Actually the various figures quoted by Landis agree very well. 
The percentage of neurotic patients discharged annually as ге- 
Covered or improved from New York state hospitals is 70 (figure 


for the years 1925-34); for the United States as a whole it is 68 
(figure for the years 1926-33). 


Charged as recovered or impr 
66 for the United States 


consolidated amelioration 


» and very little if any psychotherapy, over 
те improved to a considerable extent. ‘Al- 
though this is not, stri a basic figure for “spon- 
still any therapeutic method must show an 
seriously considered.’ 


imi ired ‘base line’ is provided by 
Denker ( 1946). ‘Five hundred consecutive disability diis due to 


ic, anxiet 
but the greatest care 
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and all had been ill of a neurosis for at least three months before 
claims were submitted. They, therefore, could be fairly called 
"severe", since they had been totally disabled for at least three 
months’ period, and rendered unable to carry on with any 
occupation for remuneration or profit” for at least that time.’ 
These patients were regularly seen and treated by their own 
physicians with sedatives, tonics, suggestion, and reassurance, but 
in no case was any attempt made at anything but this most super- 
ficial type of ‘psychotherapy’, which has always been the stock in 
trade of the general practitioner. Repeated statements, every three 
months or so by their physicians, as well as independent investiga- 
tions by the insurance company, confirmed the fact that these 
people actually were not engaged in productive work during the 
period of their illness. During their disablement, these cases re- 
ceived disability benefits; as Denker points out, ‘it is appreciated 
that this fact of disability income may have actually prolonged 
the total period of disability and acted as a barrier to incentive for 
recovery. One would, therefore, not expect the therapeutic results 
in such a group of cases to be as favourable as in other groups, 
where the economic factor might act as an important spur in 
helping the sick patient adjust to his neurotic conflict and illness.’ 
The cases were all followed up for at least a five-year period, 
and often as long as ten years after the period of disability had 
begun. The criterion of ‘recovery’ used by Denker is characterized 
by the following data: (1) Return to work, and ability to carry on 
well in economic adjustments for at least a five-year period. 
(2) Complaint of no further or very slight difficulties. (3) Making 
of successful social adjustments. Using these criteria, which are 
very similar to those usually used by psychiatrists, Denker found 
that 45 per cent of the patients recovered after 1 year, another 
27 per cent after two years, making 72 per cent in all. Another 10, 
5, and 4 per cent recovered during the third, fourth, and fifth. 
years respectively making a total of go per cent recoveries after 
five years.! ' 5 
This sample contrasts in many ways with that used by Landis. 
The cases on which Denker reports were probably not as severe 
as those summarized by Landis; they were all voluntary, non- 
hospitalized patients, and from a much higher socio-economic 
stratum, (The majority of Denker’s patients were Clerical workers, 
‘1 These percentages are plotted in Figure 1 to illustrate the process of 


recovery graphically. t 
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executives, teachers, and professional men.) In spite of these differ- 
- ences, the recovery figures for the two samples are almost identical. 
The most suitable figure to choose from those given by Denker is 
probably that for the two-year recovery rate, as follow-up studies 
seldom go beyond two years, and the higher figures for threc-, 
four-, and five-year follow-ups would overestimate the efficiency 
of this ‘base line? procedure. Using therefore the two-year recovery 
figure of 72 per cent, we find that Denker's figure agrees exactly 
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Figure 2.— Percentage of recoveries of 500 neurotic patients 
not receiving Psychotherapy 
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We may now tu : 
Эн АЙН ү turn to the effects of psychotherapeutic treatment. 
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four headings: 
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(3) Slightly improved. (4) Not improved, died, discontinued 
treatment, etc. It was usually easy to reduce additional categories у 
given by some writers to these basic four; some writers only give 
two or three categories, and in those cases it was of course 
1mpossible to subdivide further, and the figures for combined cate- 
gories are given. A slight degree of subjectivity inevitably enters 
Into this procedure, but it is doubtful if this has caused much 
distortion. A somewhat greater degree of subjectivity is probably 
implied in the writer’s judgment as to which disorders and 
diagnoses should be considered to fall under the heading of 
neuroses’, Schizophrenic, manic-depressive, and paranoid states 
have been excluded; organ neuroses, psychopathic states, and 
character disturbances have been included. The number of cases 
where there was genuine doubt is probably too small to make 
much change in the final figures, regardless of how they are 
allocated,t Өмд 
Certain difficulties have arisen from the inability ofsome writers 
to make their column figures agree with their totals, or to calculate 
Percentages accurately. Again, I have exercised my judgment as, 
to which figures to accept. In certain cases, writers have given 
figures of cases where there was a recurrence of the disorder after 
Apparent cure or improvement, without indicating how many 
Patients were affected in these two groups respectively. І have sub- 
tracted all recurrencies of this kind from the ‘cured’ and "improved" 
totals, taking half from each. The total number of cases involved in 
all these adjustments is quite small. Another investigator making 
all decisions exactly in the opposite direction to mine would hardly 
alter the final percentage figures by more than one or two per cent. 
Ve may now turn to the figures as presented. Patients treated 


Y means of psychoanalysis improve to the extent of 44 per cent; 
such factors as ехсеѕ- 
th others included in 


(1942), Friess & 
2), Orbison 


(1925), Coon & Raymond (1940), Denker (1937), 
ane Their inclusion would not have a 
ne gree, although as Miles ef al. (1951) po 
223 ваа in terms of thoroughness, careful 
th Objectivity, there is often an inverse correla 
Ө Percentage of successful results reported." 
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patients treated eclectically improve to the extent of 64 per cent; 
. patients treated only custodially or by G.P.s, it will be remembered, © . 

improve to the extent of 72 per cent. There thus appears to be ; 
an inverse correlation between recovery and psychotherapy; the — 
more psychotherapy, the smaller the recovery rate! This conclu- — 
sion requires certain qualifications. "eas 

In our tabulation of psychoanalytic results, we have classed 
those who stopped treatment together with those not improved. 
This appears to be reasonable; a patient who fails to finish his 
treatment, and is not improved, is surely a therapeutic failure. The — 
Same rule has been followed with the data summarized under — 
eclectic’ treatment, except when the patient who did not finish” 
treatment was definitely classified as ‘improved’ by the therapist. 
However, in view of the peculiarities of Freudian procedures 
it may appear to some readers to be more just to class these 
a Separately, and deal only with the percentage of completed 
Teatments which are successful. Approximately one-third of the 
Psychoanalytic patients listed broke off treatment, 80 that, the per- 
entage of successful treatments of patients who finished their. 
Course must be put at approximately 66 per cent. It would appear, 
а that when we discount the risk the patient runs of M 
pes ment altogether, his chances of improvement under Wir. 
m Увш are almost equal to his chances of inge E s 
Ms ctc treatment, and slightly worse than his chances under ©©.:. 
Custodial treatment. Ё e ааг 
im Two further points require clarification. (1) Are patients in our 
ntrol groups (Landis & Denker) as seriously ill as those in our 
.XPcrimental’ groups? (2) Are standards of recovery less stringent. 


In 5 > x 5 А sa di to 
Our ‘control’ than in our ‘experimental groups? It is difficult 
of the great diver- 


к>? 


Answer these questions definitively, in view 2 
ds 88 Of рп between рус а From а close аш " 
€ literature T should say that the ‘control’ patients were probably 
at least as seriously ill as the ‘experimental’ patients, and p 
тэн 50. Аз regards standards of recovery, 1 should 2n Ж 
Ose in Denker’s study are as stringent as most of those use ру 
Psychoanalysts and eclectic psychiatrists, but that those en 
" tate Hospitals whose figures Landis quotes аге very proba У 
more lenient, In the absence of agreed standards of severity 0 
ness, or of extent of recovery, it is not possible US BO. Цанд? 2 
Th П general, certain conclusions are possible from ис ас 
еу fail to prove that psychotherapy, Freudian or otherwise, 
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facilitates the recovery of neurotic patients. They show that roughly 
two-thirds of a group of neurotic patients will recover or improve 
to a marked extent within about two years of the onset of their 
illness, whether they are treated by means of psychotherapy or not. 
This figure appears to be remarkably stable from one investiga- 
tion to another, regardless of type of patient treated, standard of 
recovery employed, or method of therapy used. From the point of 
view of the neurotic, these figures are encouraging; 
of view of the psychotherapist, they c 
favourable to his claims. 

In saying this, I do not me 
necessarily disprove the possi 


from the point 
an hardly be called very 


an to imply that the figures quoted 


bility of therapeutic effectiveness. 
There are obvious shortcomings in any actuarial comparison, and 


these shortcomings are particularly serious when there is so little 
agreement among psychiatrists relating even to the most funda- 
mental concepts and definitions. Definite proof would require 
a special investigation, carefully planned and methodologically 
more adequate than these ad hoc comparisons. But even the much 


rapy depends on faith, not 
his faith may be justified or 
however, it clearly cannot 


It may be asked what requi НЫ, 9 . 

satisfactory proof. The (pe жеу Mugen е 
(1) А valid and r 
(neurotic, psychoti 
state, depressive, 
course have to be 


quirements would a ; : 
eliable method of as appear to be 
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ethical objections; there is no systematic difficulty there that could 
not be overcome. It is conditions 1 and 2 which raise insuperable 
difficulties at present. Psychiatry has elaborated a system of classi- 
fication (or indeed a series of such systems) which appears to have 
little support in empirical studies, but is almost entirely based 
оп clinical insight and unsupported speculation. In many cases, 
different systems contradict each other; thus the Freudians assume 
that normality, neurosis, and psychosis form one continuum of 
regression, while orthodox psychiatry appears to support the view 
that neurotic and psychotic disorders are as it were qualitatively 
different diseases which are independent of cach other. Severai 
other views are held by various groups, and there are many facts 
In contradiction to any hypothesis so far advanced. This problem 
15 recognized by most psychiatrists, who put very little faith in 
diagnostic systems, and often shrugged off as being of little import- 
ance. Yet unless it is solved, it is difficult to see how such questions 
ES the one regarding the effects of psychotherapy can be answered. 
he Importance of taxonomy is often underrated by those inter- 
ested in the dynamics (so-called) of human behaviour, yet without . 
an adequate taxonomy progress is difficult if not impossible in 
these more nebulous and even less easily accessible regions.» 
, Even if there were agreement on the main psychiatric group- 
1185, yet the reliability of assigning patients to these classes would 
rw be regarded as sufficient by most psychologists. The work of 
48h (1949), Doering (1934), Elkin (1947), and many others ЫН 
own how much disagreement there is among psychiatrists eve 
When questions of major classification (neurotic, psychotic, nor mal, 
menta] defective) are at issue; anyone familiar with case Pus 
ences will know the frequency of almost total disagreement » 
respect to major classification between equally competent psycma- 
trists, Of particular interest in this connection are some re 
Teported by the Research Branch of the Information and E цг 
SR Discussion in the U.S. War Department, showing the ume 
in dj ible differences in diagnosis between PES A icu 
p ux establishments (Stouffer, 1950) (A de 
; Р pu a MEL Е] that psychiatric diagnosis 
15 of doub en дорага SLE we may mum with "s 
ini s є may have used his 
Scie 50 иса] psychologists, in the hope that h re n E 
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dispel such optimistic thoughts. Far from improving on the some- 
what arbitrary criteria used by the psychiatrist, the psychologist 
has by and large succeeded merely in claborating devices which 
will give rather low correlations with these psychiatric classifica- 
tions; in other words, he is content to take over the conceptualiza- 
tions of the psychiatrist, and tag behind in a fruitless endeavour to 
do rather less well what the psychiatrist could do without his aid! 
If this be indeed the function of psychology in its diagnostic aspect, 
then it is small wonder that so many psychologists have eschewed 
the Kraepelinian straight-jacket in order to indulge in Freudian 
manic spells which give them at least the illusion of usefulness. 
This example is intended to illustrate a very important general- _ 
‘ization, namely, that taxonomy, nosology, or classification lies ? 
at the very root of scientific progress, and that until taxonomic 
problems are solved in at least а preliminary way, scientific pro- 
gress towards answering more, ‚complex problems is barred. The 
history of science illustrates this again and again. Without the 
work of Ray and Linnzus biology could not have advanced as it 
did; Mendeleeff and his periodic table of the elements prepared 
the way for the fundamental advances in physics which culminated 
in the splitting ofthe atom. The importance of taxonomic concepts 
in the physical sciences is often neglected because they sometimes 
seem self-evident, and because their discovery frequently precedes 
recorded history; this hardly affects the argument, however. 
Measurement is essential to science, but before we can measure we: 
must know what it is we want to measure. Qualitative ог taxonomic 
discovery must precede quantitative measurement, 
Unfortunately, psychology. has tried to abort this stage of 
development; it has attempted to run before it Could walk. In its 
desire to be ‘dynamic’, it has forgotten that before it is ossfbla to 
study movement, one must know at least to а liniteq ЫН hat 
itis that moves. Where this was realized, taxonomic con т H e 
invented ad hoc to suit the argument, rather than allow, зра SE 
organically from the accumulated mass of facts. ЖЕДИ to grow 
and Eros, for example, so similar in essence to the anc; 2 Thanatos 
principles of yang and уй, can hardly be Said to р cient Chinese 
scientific observation. Rather, they appear Superim € the result of 
vationzat the dictate of philosophica] doctrine S esed on obser- 
of fact to fictibn has often appeared Üa scien, неп Subservience 
anatomy, for instance, taught that the body ©. Ancient Indian 
of PR bearing the ten fluid Products of tli "sess ten systems 


j - Ody to all portions 


trai ч 
_ traits and types, and of measuring them objectively. In par 
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{ш and fluid products were actually observed; ten was 
Such red to be the perfect number. Anatomists consequently 
ЫН t for, and believed they had found, structures which agreed 
ү Эв philosophers’ notions of perfection. This type of taxonomy 
E ardly be considered scientifically useful. 
К ле rest of this book, then, is concerned with the problem of 
omy in psychology; the question of classifying and isolating 
our interest | : Р ticular, 
— st has centred around the following questions. (1) It is 
Bud ns шс psychologists and psychiatrists to study neurotics 
лэ» otics because of the light these abnormal states are 
E onc to throw on the functioning of normal personality. The. 
hn ж made that in mental disorder we observe ordinary 
beliefed 5 aviour "writ large’, as it were; the disease process is 
enlarges 9 Tu asa kind of magnifying glass or microscope which 
ona E atis normally invisible to the naked eye. This assump- 
closely & e well founded, but it has certain implications which are 
B con m to problems of taxonomy. If the main types of 
fication of ysteria and psychasthenia) are the prototypes of a classi- 
Ung mai normal human beings into extraverts and introverts, аз 
phrenics ОРОН ог if the main psychotic division into schizo- 
uida Энн manic depressives should be used for grouping normal 
follow rA gin a into types, as Kretschmer believes—then it must 
different K neurosis and psychosis are not disorders qualitatively 
in the Клет Ше ordinary run of personality differences found 
Which inet mi population, but simply extremes ona continuum 
retschm udes everyone. T his assumption is made explicitly Бу 
would he t. and at least implicitly by Jung; many psychiatrists 
States; С ап opposite view at least with respect to the psychotic 
be b like Kallman, who believe they have isolated the 
а inte for the transmission of psychotic insanity, must 
to the mee old the qualitative view, of course, and thus be added 
ve Жап АЖ of orthodox psychiatrists. This question of quantita- 
саа аз ог qualitative difference 1s too important to be 
experim, у subjective impression or majority vote, however; and 
us with ent must decide which of these conflicting views presents 
Ч сс а more accurate model of reality. It will be shown later 
Р т experiments strongly support the quantitative continuum 
YPothesis, both in the case of neurotic and of psychotic disorders. 
iud If we have, then, a continuum from normal to neurotic, 
also from normal to psychotic, the question arises whether we 
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are dealing with two continua, as many psychiatrists would have 
us believe, or only with one continuum of ‘psycho-sexual regres- 
sion’, as Freud and his followers assert. As is well known, to Freud 
the neurotic has partly regressed from the adult psychosexual adjust- 
ment, while the psychotic has regressed to a considerably greater 
extent; thus we would on his hypothesis be dealing with a con- 
tinuum going from the normal, well adjusted person at the one end, 
through the partly regressed, poorly adjusted neurotic, to the com- 
pletely regressed psychotic at the other end. In terms of dimensional 
analysis, the question may be reworded in this form:—If we take — 
three groups (normals, neurotics, psychotics), can these be repre- 
sented as occupying one dimension only, or are two dimensions 
required? It will be shown later that the Freudian theory has to be 
rejected, and that two dimensions«are required to represent these 


(3) We have so far discussed ‘neurotics’ and ‘psychotics’ as if 
these were homogeneous groups. Yet it is well known that sub- 

: ajor groupings—hysterics, 
anxiety states, obsessionals, neurasthenics, and so forth within the 
» Manic depressives, schizophrenics, 


y ychotic complex. It is with 
respect to these sub-groupings that typologies such as the Jungian 


у Clearly our dimensional analysis must 
deal with the questions raised by these Sub-groups as ve as with 
„ 


problems of allocation of such groups as PSychopaths and epileptics; 
whose precise provenance is much in doubt among psychiatrists, 


the major factors of neurosis 


€ported i are 
relevant, and may serve as an introductory оцон chapters aro. 
ation. 


Experiments and statistical models cannot exist in vacuo: they 
EJ 


ta. For our Purposes, the obvious 


lendin ^ 
the kind required were those collected а to D 4 
se of psy 


logical tests. Choice of tests, therefore b 
> Оёсаше an im art 
portant p 


followi 3 

choice, 15 considerations will serve 
rejected certain types of test which ied 5, е reasons why we 
such as the so-called „Ргојесбуе њене Wide currency, 
centrated on tests which have been ques, and have con. 


much neglected lately, 
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namely those types of test which I have called *objective behaviour 
tests? (1950a). ф " 

It will be agreed by most psychologists that the subject-matter _ 
of a science of personality must be human behaviour. We may 
require intervening variables, hypothetical constructs, and theor- 
€tical concepts such as instincts, drives, reflexes, complexes, atti- 
tudes, or fixations in order to bring some kind of order into the 
Confused mass of facts which observation offers us, but nevertheless 
our starting point must be actually observed behaviour—provided 
We are careful to include all aspects of verbal, autonomic, and 
involuntary behaviour. Most psychologists would also agree; 
Perhaps, with Wolfle's (1949) ‘fundamental principle of personality 
measurement’: ‘An individual reveals his own personality through 
any change he makes upon any type of material.’ Rephrased, this 
might read: An individual reveals his own personality through any 
observable item of behaviour. This would equate ‘personality with 
stm-total of behaviour’, and with the reservation that some items 
of behaviour are more important than others—in the sense that ; 
they show à greater range of intercorrelation and of ‘belonging- " 
Ress’, and are more widely predictive than others—this definition 
would Probably not be objected to too vehemently. 

ersonality study therefore starts with behaviour, and attempts 

to find general laws which will explain*this behaviour; it must test 

these laws, and their predictive efficiency, against behaviour in 

Situations relevant to the generalization embodied in the law in 

question, Measurement ofa particular sector of behaviour is usually 

i ад ertaken by means of a test. Such tests are often classified as 

“ing éither ‘projective’ or ‘psychometric’, but this division is 

ardly adequate to denote a given test. There are a number of 

Principles of classification which together denote a given test with 
Sufficient accuracy. 


(1) Classification according to Type of Stimulus iE 
Test material may vary according to the degree of organization 
°F Structure present in it. In highly structured material, the correct 
response is uniquely determined by the relations existing between 
the different parts of the test; in less well structured material, the 
‘crimination of the parts, and their interpretation, often fcrm the 
Major task of the subject. This classification clearly does not imply 
5 dichotomy; there are different degrees of organization or struc- 


ture from the typical intelligence test at the one extreme, through 
* 
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the relatively well-structured Thematic Apperception Test, the 
less well-structured M.A.P.S. or Rorschach Tests, to the almost - 
entirely unstructured Stern Cloud-Pictures or the Buck Н-Т-Р 
test at the other. In general, we may зау that the more firmly 
Structured the test material or the test-situation are, the more 


restricted will be the variety of ways in which the subject can react 
to the test. 


(2) Classification according to Type of Reaction 


The two main ways in which the subject can react to the 
stimulus situation are the verbal and the non-verbal. These can them- 
selves again be classified, the verbal mode of reaction into written 
and spoken, and the non-verbal into autonomic and motor response: 


(3) Classification according to Mode of Response 


The two main modes of response are the creative and the selec- 
tive. In the creative type of response, the subject is required Ч 
Supply the solution to the problem quite unaided and entirely 
from within himself, while in the sclective type of response he 25 


merely asked to choose one of several predetermined solutions. 
While the selective response metho 


connection with intelligence tests, an 


1 Association, and the Sentence 
Completion. 


(4) Classification according to Method of Scori, 


Scoring may be objective or sub; 
when only one symbol, numerica] 


assigned t 
“is made to assign a symbol at all, but Syr or when no attemp 


: ЗУР ci M 9 а general impressionistic 
interpretation is given of the subject’s Performance. 


(5) Classificatios according to Interpretation of Score 


‚ The final product of the subject’, 
whether it bé verbal or non-verbal 
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classified as being either symbolic or non-symbolic. Tests of fluency, of | 
persistence, of suggestibility, of perseveration, or of level of aspira- 
tion give scores which are interpreted directly; others, such as the 
Frank Sexual Symbol test, the Dream Interpretation technique, 
ОГ most drawing, painting, and play tests and techniques make use 
of symbolic interpretation of the products. Both methods of inter- 
Pretation may of course be combined in dealing with the results of 
any one test. 


(6) Classification according to Mental Mechanism used 


Tests can be classified on the rather theoretical basis of the 
mental mechanisms believed to be employed in them. Thus, some 
tests are thought to call forth responses based on the principle of 
Projection (such аз the Thematic Apperception Test, for instance); 
others are based on the hypothesis that responses are determined 

У the Principle of identification (the Szondi test); others still make 
use of reaction-formation, regression, and various other Freudian 
mechanisms, Clearly, little agreement can be expected on alloca- 
Чоп of tests according to this principle, particularly as experi- 
mental evidence for the very existence of the mental mechanisms 
js Question is almost completely lacking. 

tis interesting to note in this connection that the one mental 

mechanism which ‘projective’ tests do not usually employ, in the 

opinion of experts in this field, is the mechanism of projection. 

argent (1945) ^in her review of ‘projective’ tests, makes а rather 

~ Singenuous defence of this confusing practice, as does Bell (1948) 

12 his book од Projective Techniques. Semantic oddities of this kind, 
Wever, have not prevented the widespread use of this term. 


A word should be said here about the term ‘test’. A test is defined by 
Warren (1934) as “а routine examination administered to individuals belonging 
to the same group in order to determine the relative position of a given indi- 
Уча in the group with respect to one or more mental traits, motor abilities, 
cte., or in order to compare one group with another in these characteristics". It 
2€ seen that, if we accept this definition, a projection test becomes a 223 
tradiction in terms, as the ‘projective hypothesis’ seems to deny the very exist- 
ence of these mental traits which are presupposed in the definition of the term, 
test’, By апа large, then, it would appear that the term ‘projection test’ has an 
Ч fortunate resemblance to the term ‘Holy Roman Empire’, which was not, 
Strictly speaking, an Empire, not entirely Roman, and very far from holy; 
Projection tests are not tests as the term is usually understood and they do not 
Sven claim to make use of the mechanisms of projection as defined by Freud and . 
is adherents. The term is a bad one and should be dropped fiom psychology. 
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(7) Classification according to Method of Administration 


Tests may be administered in the form of individual or group 
tests; it is doubtful if this differentiation is of any fundamental 
importance. It is included here for the sake of completeness. 

The seven principles of classification briefly discussed above 
are relatively independent of each other; all possible combinations 
of type of stimulus, ype of reaction, method of response, method 
of scoring, interpretation of score, and mental mechanism hypothe- 
sized may be found. Frequently the same test may be used in 
different ways by different investigators, changing its classification 
under some of these headings. 

Two techniques are used almost exclusively in current practice 
of personality testing. They are the questionnaire—well structured, 
with verbal reactions, selective response, objectively scored, and 
not symbolically interpreted and the so-called projective tests— 
little structured, with verbal reactions, creative response, subjec- 
tively scored, and often symbolically interpreted. It will be noted 
that both procedures duplicate psychiatric methods, and may 
under special circumstances be substituted for them. The question- 
naire duplicates the psychiatric interview; the ‘projective’ test 
duplicates the interpretation of dreams and symptoms by the 
psychiatrist. Both are essentially verbal in nature. 

hile in our research we have not neglected these procedures; 
we have laid most stress on what might be called ‘objective 
behaviour tests’. These tests are highly structured, call for non- 
verbal reactions—cither autonomic or motor, are objectively 
scored, and require no interpretation, This type of test has 
a peared to us to be in the tradition of behaviouristic psychology: 
t seems premature for clinical Psychologists to throw over what is 
most distinctive in their intellectual heritage, in order to don with 
unseemly haste the Emperor’s New Clothes, It is our belief—and 
Some support for it will be found in the Pages that follow. that 
objective behaviour tests tap fundamenta], Constitutional depths 
of personality organization which questionnaires 4 ү Е ыг 
_tests’ do not reach, while conversely these Psychiat = Pa ча 
are highly reflective of passing moods, sentiment e types of te 5 

forth. Аз we are concerned more with th 5 Icelings, and 5 
ту i я € underlying dimensions 

of personality than with the small ripples on th ying ? 

may perhaps be intelligible. © surface, our choice 

In one other respect will Cur work be fo 


und to deviate from 
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that of many others in this ficld. Students of personality have often 
used personality ratings, made by psychiatrists, teachers, super- 
visors, or laymen as an aid in the classification of human traits. 
Outstanding examples here are, for instance, Burt (1937), Cattell 
(1946), and Moore (1930). In our opinion the value of ratings 
cannot compare with that of tests; tests are measured items of 
Performance, while ratings imply judgments formed on the basis 
of personal impression of performances inaccurately perceived, 
Sampled under biased conditions, and evaluated according to 
unknown standards, Quite frequently, the very traits rated are 
selected according to criteria which would encounter severe criti- 
cism; Burt’s work, for example, is based on the rating of the strength 
ae McDougallian ‘instincts’ in children by teachers. The choice of 
2 System of classification some’ thirty years out of date, and dis- 
credited in contemporary psychology for many valid reasons, 
makes this work interesting more as a statistical exercise than as a 
Senuine contribution to the study of personality. Cattell's work is 
Rot subject to this criticism, and it must be admitted that he has 
Seen quite clearly that the value of syndromes and factors derived 
rom ratings is dependent on verification through the use of other 
methods, Nevertheless, until such verification is forthcoming—and 
it will be noted that ee of the results reported in this volume 
agree well with Сане analysis—conclusions based on ratings 
a ust at best be regarded as suggestive only, and as possibly telling 
8 More about the rater than about the ratee. 
an | € Might be retorted that in our own work ratings have played 
th Important part also. This is true, but it should be pointed out 
lat our use of ratings and diagnoses has been fundamentally 
M sns from that of the above-mentioned authors. We have br 
ngs as a kind of scaffolding, to aid in the construction of a soli 
edifice based on objective measures of behaviour; once this edifice 
15 Constructed the scaffolding may be taken down. It plays its part 
Баг as а transitional aid; it is not intended to replace the 
„shed building. Just how this method works is described in detail 
m the following chapter. 


к. 


Chapter Two 
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the taxonomic problem in personality research, then we are 

involved automatically in the problem of finding appropriate 
dimensions of personality. And for a method to aid in the solution we 
must turn to factor analysis, because in spite of the acknowledged 
difficulties and weaknesses of this method there does not exist, at 
the present stage of our knowledge, any other method which could 
aid us in our quest (Eysenck, 1949). This chapter will be devoted 
to a discussion of the relation between factor analysis and dimen- | 
sional research, as well as to an investigation of certain criticisms 
often made of the factorial method. It will also contain an element- 
ary introduction to the method of criterion analysis which has been 
introduced to overcome some of these criticisms, and to bring 
factor analysis into closer touch with the hypothetico-deductive 
method (Eysenck, 1950). 

All through this book, then, we are concerned with problems 
of psychological dimensions. These problems are absolutely funda- 
mental in science. If science depends on measurement, we must 
know what to measure. Thus we cannot make direct comparisons 
of magnitude between things which are qualitatively unlike 
which do not lie along one diniension. We Cannot form а dimen. 
sionally inhomogencous equation, such as 523 hours Жл ОЛ 
= 20 sacks of potatoes + 17 miles? without violatin farm 
sense as well as physical propriety: Yet Similarly q; dep 
5 ? f Y dimensionally 
inhomogeneous equations are formed constantly Y psycholocists 
and psychiatrists because of our fundamenta] lack a n, 5 
regarding ‘dimensional homogencity’, Поу ов 

а ject that choi i en 

vw кее may obje Oice of dimension; in psychology, 

1 The first pz-t of this chapter has on u 3 
plified level th ede that the pe of the book ni on Written at a very sim- 
without a background in factor apalai It should ie oe a readers 
already fully familiar with factoria E x33 and their Бат ded readers 


Ї our main task is to provide at least а provisional solution to 


à 
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E 


particularly if such choice is dependent on statistical procedures 5» 


Such as factor analysis, is arbitrary, thus setting psychology off 
from the remainder of science. This implies a profound misunder- 
standing of the procedures of physical science. In a very real sense, 
We may say that in physics ‘the choice of dimensions is arbitrary 
(Scott-Blair, 1950). ‘If we are moving towards or away from a 
Source of light or sound, the colour or pitch appears to change. It 
15 easy to calculate our speed from the ratio of change in wave- 
length to normal wave-length, which is, of course, a dimension- 
less number . , , we should be quite justified in defining velocity by 
this number instead of as a length divided by a time.’ “There 1s 
nothing absolute about dimensions . . . they may be anything con- 
sistent with a set of definitions which agree with the experimental 
facts’ (Bridgman, 1931). Р 
If, then, dimensions are in a sense arbitrary in physics, it seems 
unreasonable to expect factor analysis or any other statistical pro- 
cedure to give us psychological dimensions which are not up to a 
point arbitrary. All possible systems of dimensions to describe a 
given set of facts must be convertible into each other, as they all 
aM agree with the experimental facts; if two systems of dimen- 
s ns disagree an empirical test becomes possible to decide which 
ae two leads to deductions verifiable by experiment. уе: 
388 actor analysts disagree in their analysis of a given table 2 
Correlations, it is possible either (1) to convert one set о 
dE DR into the other through a set of intérmediate equations, thus 
А wing that these are merely alternative dimensional systems 
dually adequate to represent the facts, or else it 1s possible (2) to 
*20W that one solution is statistically unsound, or leads to deduc- 
rions which can be disproved. The argument between Spearman 
1927) and Thurstone (1935) was of the latter kind, leading to a 
-'SProof of Spearman’s original position; most arguments In the 
"'erature, however, are of the former kind. зан : 
4 9 acknowledge that one's choice of dimensions is arbitrary 
0—5 not mean, of course, that any set of dimensions may be chosen. 
П Practice, the restriction imposed by the requirement that dimen- 
Slons ‘may be anything consistent with a set of definitions which 
98766 with the experimental facts’ rules out ай but a very few alterna- 
tive sets of dimensions, and as facts accumulate choice becomes 
Very restricted indeed. Even to find one single set of dimensions to 
Embrace all the known facts of personality research may appear а 
tall order; to find several such sets would tax the imagination of 
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most psychologists unduly. The undoubted existence of scveral sets 
of dimensions, proposed by the various schools, is accountable in 
the main by the very simple procedure of unmercifully rejecting 
all facts not fitting into a given scheme. If psychologists followed 
the example of physicists and rejected any model which M 
clearly in opposition to experimentally verified facts, the groun 
would be cleared of much rubbish the main effect of which is to 
hinder the advance of psychological knowledge. | 
Science, ав we have seen, attempts to describe the хо 
world of experience through the formulation of abstract laws ant 
the creation of abstract categories. This process of abstraction 15 
absolutely fundamental to science; without abstraction there s 
be nothing but observation of particular occurrences. As Whitehea! 2 
(1929) puts it, ‘the paradox is now fully established that the шин 
abstractions are the true weapon with which to control our though 
of concrete fact. To be abstract is to transcend particular concrete 
occasions of actual happenings. The construction with which the 
scientist ends has the neatness and orderliness that is quite unlike 
the varied and multiform world of common sense, yet, since science 
grows out of and returns to the world of common sense, there must 
be a precise connection between the neat, trim, tidy, exact world, 


which is the goal of science, and the untidy fragmentary world 0 
common sense.’ й 


There is of course an abundanc 
—instincts, (Edipus and other com 
reflexes, maternal overprotection, 
ergs and metanergs, abilities and p 
and acuities, colour blindness, heredi 


€ of abstractions in psycho losy 
Plexes, reflexes and conditione 


that they are abstractions, but rather on the fact that they fail to 
show the ‘precise connection’ with the observed happenings; that 
they fail to account for all the observations а that they 
fail to forecast accurately future observations ae t made. 
Similarly, ‘factors’ and other Statistical Conceptions sh 3 d not be 
criticized for being abstract concepts; valid шон оша п 27 
only from a demonstration that they fail to show m can c a 
nection between observation and hypothesis. Que ын precise cc 

the ‘actual eXistence’ of factors, ог Criticisms cre lons regarding 
‘statistical artefacts’, show a complete mj erstanding 28 Бе хо 


PX ace БЭЛ musund, 
of сопсеріѕ іп science; a physicist Would regard as mean; less 
eaningle 
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questions regarding the ‘real existence’ of the ether, or criticisms - 
of the concept of the electron as being a ‘statistical artefact’. 
Concepts are useful in science in so far as they help in introducing 
order into a confused field; they cease to be useful when they fail 
to account for all the relevant phenomena, or when they cease 
to give rise to accurate predictions regarding new and hitherto 
unobserved phenomena.* 
Let us start our discussion of factor analysis with a simple and 
straightforward example, in which we shall as always pay attention 
to the scientific logic of the method rather than its mathematical 
foundations. Let us assume, as Spearman did in his revolutionary 
article in 1904, that underlying all cognitive tasks there is one 
general ability, which we may сай ‘g’ or "intelligence". Let us also 
assume that different people possess this ability to varying degree, 
к. that different tasks сай for this ability іп varying degree. Let 
5 make one further assumption, namely, that there 15 no other 
ability required in order to succeed in these tasks except ‘g’ and 
Something which is specific to cach particular task—a specific 
ability which we may сай 5, for task 1, fe for task 2, and so on to Sn 
for task л. 1 
Now if by some magic we could obtain ап ‹ 
m "intelligence" ofa random sample of people; it would be possible 
9 correlate this measure—which we may denote bya capital G 
with each of our tests. Let us suppose that we have four Цол 
НЫ С, and D—-and that the correlatións of these tests with ‘G 
tir 8, -7, and -6 respectively. It can be shown that, Bu Жо 
prod Ptions, the correlation between any two tests is Буве уч 
oo Uct of their respective correlations with ©”. In other wo 7 
sts A and B will correlate :9 X ‘8 = 72 tests C and D wt 


accurate measure of 


1 « 
Ein © may in this connection draw att 
and Sp, БР in constructing his theory of relativity. In the fi 
УГ Бан Theory of Relativity, Einstein based his physic 
lished ps on Minkovsky's and Riemann's gcometrical pre 
n including their electro 


soem (Xs Xy Xa, 
hysical momenta; 
cist would take 


uring the year 1933 Einstein adopted the five parameter Sy. 

Ф Xj) because the previous system failed to explain certain p! 
ны requiring the introduction of a new parameter. No physi 

riously the question: “Have Xs; X, etc. any real existence?’ 
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correlate 7 x -6 = -42, and so forth. This may best be shown in 
the form of a table (Table I): 


TABLE I 
Correlations | Test A | Tet B Test C Test D 
7 wih'G' | 9 В 7 б 
~ "| EIL е = 
9 (-81) "72 “54 
8 | 72 (64) 56 48 
7 3 `56 (49) 42 
6 54 48 42 (-36) 


The two-digit values in the central part of the table form what 
is known as a matrix, that is to say a rectangular array of figures. 
It will be seen that the diagonal values arc put in brackets; this 15 
done because it is impossible to correlate a test with itself. We can 
of course correlate parallel fozms of a test, or repeat the adminis- 
tration of a test, or use split halves. None of these methods, how- 
ever, would give us values dependent only on ‘g’; they would all be 
influenced by the specific ability required by the test in question, 
by memory, practice, and all sorts of other factors. Consequently 
they are estimated by analogy with the other values in the matrix, 
. ie, they are formed Бу multiplying the “С? correlation of a test 
= with itself, i.e. by squaring it. In this мау, :9 x -д = -81 for test Ay 
and so forth. These values are not, as are the other values in the 
table, subject to empiricàl check. They form part of the gener 
hypothesis we are investigating. ў 
Let us look at this table a little more closely. All the values 
in column D will be-seen to be Proportional to {позе in column 0977 
58/:54 = 56/48 = 49/42 = 42/36 — 1-167, Similarly, any | 
other pair of columns is Proportional. This fact can be put in many 


e different ways. Let us PE the intercorrelations of two tests—say 
A and D—with two other tests—say B and G 4 
i д ns 
are: . These correlatio 
72 63 
“48 42 


8: 
b. 
By the proportionality rule, -72/-48 = Gay, 3 Y 
expressed in the form: 72 x -42 =. 48 A und rara E 
with (-72 x 32) — (48 x :63) = 0. This last ex ue is identical — | 
аз the tetrad criterion; it expresses the єв ession is known 
given set of tests the assumptions mage above ace мн if I 
rne out in 
+ 
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fact, then any tetrad (set of four correlations) formed in this 
manner must come to zero within the limits of the sampling 
error. 

We can now dispense with the idea that by some magic we had 
managed to obtain an accurate measure of the ‘intelligence’ of the 
persons in our sample, a measure which enabled us to calculate 
the correlations of “С? with tests A, В, С, and D. We start out, as 
in fact we always must do, with the matrix of intercorrelations, as 
actually observed on a sample of the population. We also start out 
with an hypothesis—the hypothesis that these intercorrelations are 
due to one factor (“2”) only, and that there is nothing in common 
between any two tests that cannot be accounted for in full by this 
factor. We can then deduce, in the manner shown above, that if 
Our hypothesis is indeed correct, then it would follow that the 
Observed intercorrelations should show a certain pattern of pro- 
Portionality—in other words, that all the possible tetrads should : 
Come to zero. Having verified this deduction, we may wish to 
know how closely each of our four tests correlates with this 
hypothetical factor ‘g’. This can be calculated very easily in a 
number of ways. Quite obviously, if we knew the diagonal values, 

We could immediately calculate the required correlations by taking 
m each cast the square root of the diagonal value; М8: = 39 


64. = -8, and so forth. Unfortunately, the diagonal values are" à 


not given to us. But we may deduce them from a knowledge of the 

proportional arrangement of the whole table. We can form a 

ieee in which one of the values is the sought-for diagonal ie 

. ich as it is unknown we may denote by an X. We then ge 

equations of the lind: 

: (X x 48) — (72 X 54 =° ! 

from which we can then calculate that X = 87, and that conse- . 
quently the correlation between ‘G’ and A is equal to 9. This 
correlation is customarily referred to a5 the ‘saturation of test А 
with factor ‘g’, or more simply test A's factor saturation. The factor 
Saturations of the other tests can be calculated in a similar manner. 
So far, we have been dealing with a purely hypothetical 
€xample. To lend an air of verisimilitude to the proceedings, let us 
Ook at the actual correlations observed between four “tests of 
neuroticism (Table II). Each of these ‘tests’ is a questionnaire, 
Carefully designed to measure a different area of this general field- 
est A measures ра по-ошачс complaints, test B measures child- 


э; 


- 


^ 
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hood symptoms, test C measures acceptance of soldier role, and 
test D measures sociability. Each of these tests had been shown — 
previously to be a scale or a quasi-scale in Guttman's (1947) sense 
of the term; the sample consisted of some four thousand normal 
and neurotic soldiers. Here are the correlations, together with 
the diagonal values and the ‘saturations’ of each test with the 
hypothetical factor of ‘neuroticism’. 


TABLE II 
Correlation with | TetA | Test B TetC | Test D 
Neuroticism | 83 | *67 *56 | +46 
| | 
| Ч Т 
:83 | 69) :55 50 38 
:67 55 45) 36 34 
'56 50 ‘36 (32) 25 
. 46 38 ЕС 25 (21) 


As these correlations are observed correlations, they are of 
course subject to sampling errors, and the products of the factor 
saturations will not reproduce the observed correlations exactly: 
A matrix showing the products of the factor saturations is called 2 
product matrix, and can be compared with the original matrix 0 
observed correlations, to show how closely the hypothetical factor ‘ 
accounts for the actual test intercorrel 


: ations. Below (Table III) 8 — 
given the product matrix for comparison with Table IT. 


4 


TABLE ш ^ 
| 83 | 67 56 46 эр 
8 | (49-| 46 | 2-2 
"03 "99 *5 46 p 
67 | 56 (-45) En ae 
'56 46 38 (32) 2 
-46 -38 "31 26 C21) 


It will be seen that observed and p AM | ». 
agree fairly well. Where the observed Кел нш en a 
thetical one is -56, and so forth; the average АВАН 55, the less 
of sign and omitting the diagonal values, is m 2 rega E. | 
conclade that our hypothesis gives д fairly pk 102. Е E. 
‘untidy, fragmentary world of common $ nae Picture o 

If we subtract the (hypothetical) ‘pro pem КИРИ. 

? 


Y These figures are taken from Stouffer of al. (1 ) 
* (1950). 
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Shown in Table III from the observed correlations, as shown in 
Table II, we get what are known as ‘residual’ correlations, i.e. 
correlations between the tests after the influence of the hypo- 
thetical factor has been removed. These residual correlations, in 
the present case, arc all insignificant and due only to sampling and 
9ther chance errors, but that is not always so, and it will be clear 
that in principle it is possible to analyse a matrix of residual 
Correlations in exactly the same way as it was possible to analyse 
the original matrix of observed correlations. Computations are 
actually performed in a rather different manner, but this does not 
invalidate the point made. 97 
It is often said that while the argument from the original 
hypotheses to the necessity of finding proportionality and Zero 
tetrad differences is rigorous, the finding of such zero tetrad differ- 
ences can by no means be taken as a proof that the hypotheses are 
Justified. That of course is true; no hypothesis can ever be proved to 
€ right. All we can do is to make deductions from our hypotheses 
aud attempt to show that within the limits of sampling errors and 
errors of observation these deductions are verified. If deductions 
are not Verified, the hypothesis is disproved; if deductions аг 
Verified, {һе hypothesis is confirmed. There is, however, no ed, 
assuring that if further deductions are made, these will also n 
Confirmed; in that sense proof is forever impossible. Newton's 
YPothesis continued to be confirmed for three hundred years іп 
the wide variety of deductions made хот it; yet finally some 
eductions were shown to be contrary to observed fact (motion of 
ercury; bending of light rays). Here, as inso many other reped 
, ctorial theory should not be singled out for criticism eee 
Justified, applies to all scientific methodology. The hypothesis of а 
Single factor can be put forward, deductions can be made from it, 
and these deductions can be verified or refuted. If the deduction 
of tetrad differences amounting to zero 1s verified, this does not 
Prove the correctness of the hypothesis, but it does provide striking 


Support for it. 

i € may now turn to two questions which arc of the greatest 

1прог(апсе in factorial analysis, and which we are m a position to 
1888 even on the basis of the very simple statistical model we 
ауе constructed so far. The first of these questions 15: "Having 
Ound a factor in this manner, how can we interpret it? The 

Second question is: ‘If our interpretation of a factor involves, as 1t 

Often does, an external criterion, why factor analyse at all—why 
5.8.Р. М 
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not work through multiple regression and other orthodox pro- 
cedures in order to get maximum correlation between tests and 
criterion?" 

Let us take an example. In the matrix of correlations below 
(Table IV) let A stand for a conventional test of intelligence, and 
B, C, and D for ratings made by three teachers of the intellectual 
ability of a group of children known to them, all of whom have 
been given the test. From these intercorrelations we can show that 
the hypothesis of a single factor as being responsible for all the 
observed values is tenable, and we can proceed to calculate the 
factor saturations of test and teachers’ ratings alike. These 
are shown at the top and at the left side of the matrix of inter- 
correlations. 


TABLE? IV 
| A | B | C | D 
| 5 |. | 4 = 
9 (38) | :63 54 35 
8 :63 (49) 42 35 
1 54 42 (36) 30 
i 45 :85 “30 (25) 


If we correlate the average of our three teachers’ ratings with 
our test, we find a value of 71. We might therefore say that as far 
as orthodox statistics carries us we end up with a moderate correla- 
tion between our test and what we may regard provisionally as 
our criterion. We cannot tell why this correlation is not higher— 
possibly the test is at fault, possibly the criterion is imperfect 

The factor analysis shows us that teachers? ratings and test are 
measuring one and the same trait; it also shows us that there is 
nothing in common to the three teachers’ ratings that is notaio 
in common to the test. In other words, test and ratings alike are 
giving an estimate of the same underlying, hypothetical t es 
intelligence. That being so, we can see that the té de з rai А 
to the extent indicated by its factor saturation (-9) тар 
teachers’ ratings are rather less successful (7, 6 52) 3 уша p е 
we average the teachers' ratings, the correlation of th. 5). Even i 
rating with the factor would only amount to 510. F е combined 
therefore, haz enabled us to supersede the origin al bed analysis, 
show that our test is a better measure of in, riterion and to 


Lem x telli 
criterion we started out with. We may now di E oe 
€ origina 
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criterion and rely exclusively on our test, as giving us a better 
estimate of a child's intelligence than do the ratings. (In practice, 
of course, many more tests, teachers, and children would have to 
be used before such a conclusion could be accepted. We are only 
interested here in the logic of the argument, not in its. practical 
aspects.) 

Another method of looking at the question of the criterion 
suggests itself. In Table II we gave the intercorrelations of four 
tests of neuroticism, and calculated their saturations for this factor. 
It is possible to analyse the differences in score on each of these 
four tests for a normal and a neurotic group, and to compare the 
success with which each test accomplishes the task of segregation 
with its factor saturation. If the factor we have isolated is really 
identified correctly as one of neuroticism, then we should expect 
the ability of a test to discriminate between normals and neurotics 
to be proportional to its saturation for the factor; the test with the 
highest factor saturation should give the best discrimination, the 
test with the second highest saturation the second best discrimina- 
lion, and so forth. Table V shows percentage of normals and 
neurotics having ‘neurotic’ scores on each of the tests, difference 
between the two percentages, biserial correlation of each test with 
the ‘normal versus neurotic’ dichotomy, and the factor saturations 


previously given. „© 
TABLE V 
- нэ БУ? 3 | E == 
7 : : Biserial Factor 
Normals | Newatics f Direna [Correlations Saturations 
< 2 4 | ? 
% | % 
Psychosomatic complaints 20 | 89 бо -66 83 
Childhood symptoms 20 | 53 33 -38 -67 
Acceptance of soldier role 31 | 59 28 35 +56 
33 46 


Sociability | б | 45 29 


It will be seen that the column giving the factor saturations is 
directly proportional to the column giving the biserial correlations 
of cach test with the dichotomy ‘normal versus neurotic’, as 
required by our hypothesis. As this column of correlations between 
tests and criterion will play a considerable part in our argument 
later, we may perhaps give it a special name and call it the 
‘Criterion Column’. At the moment, let us merely note the fact 
that this proportionality is unlikely to have arisen by chance; that 
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itis in line with our prediction based on a certain interpretation 
of our factor; and that our faith in the accuracy and correctness of 
our interpretation is therefore considerably strengthened. We have 
not proved that our hypothesis is correct; we have merely shown 


that a relatively large number of facts can without distortion be 
subsumed under one consistent set of concepts. 


In doing so, however, we have made an assumption which is 
not supported by the data so far given. We have assumed that 
‘neuroticism’ is a trait which forms a continuum from the ‘normal’ 
to the ‘neurotic’ end, and that our groups of normals and neurotics 
are merely random samples chosen from different points of this 
continuum. We may present this assumption diagrammatically, 
as in Figure 3, letting the ordinate represent the hypothetical 


+ 
NORMAL 


A NEUROTIC 
Figure 3.—Hypothetical ‘Neuroticism’ Continum 


continuum, and plotting the number of people falling at cach 
point along the abscissa. We may assume a normal distribution 
for the sake of the argument; in actual fact the p 


ee om à : articular form of 
distribution is not of any importance. Points near the plus юм s 
o. 


our hypothetical continuum represent well integrated emoti 
ally stable, non-neurotic personalities; points towards és motion- 
end of the hypothetical continuum represent poorly i € minus 
emotionally unstable, neurotic personalities. At Point A aerated, 
drawn a line to indicate that pcople located to the ri e have 
point are particularly liable in our Society to { us of this 
psychiatrists, to be labelled ‘neurotic’, and to € treat m erred to 
of a variety of medical and psychological methods Ch by means 
will ensure that some people to the right o this ре ance errors 
fact be so diagnosed and treated, although st. ic nt will not in 
ought to be, While some people to the left of poi E “Peaking they 
those in its close proximity, will be diagnoseq ыг Particularly 
though they deserve this label less than others wh €urotics, а]. 
9 manage to 
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avoid it. But by and large such chance errors will only attenuate 
the accuracy of this division into two classes; they will not 
invalidate it. . 
This hypothesis should be contrasted with another hypothesis 
Which is not contradicted by any of the data we have so far con- 
sidered. It may be maintained, and in fact it is often maintained 
both by popular opinion and by some psychiatrists, that ‘neurotics’ 
are qualitatively different from ‘normals’. This view would recognize 
no intervening stages; a person is either a neurotic, or he is not. 
In some cases this clear qualitative difference is obscured, to 
all appearances, by other factors overlaying this fundamental 
dichotomy, but when these are cleared away a person is found to 
fall quite definitely into one or the other of these two categories. 
Such categorical or qualitative thinking has always characterized 
carly stages of scientific progress; usually it is shown later that 
What was considered qualitatively different at опе time is really 
reducible to quantitative variation along some kind of continuum. 
hat 18 required is an experimentum crucis, і.е. an experiment 1n 
Which deductions are made from the two contrasting hypotheses 
Which are mutually contradictory, so that one set of deductions 
Сап definitely be shown to Ье false. Again, this does not prove that 
the other hypothesis is correct; it merely adduces evidence in its 
Support. : 
On the basis of the qualitative hypothesis, we would assume 
t with respect to whatever it was that differentiated normals 
from neurotics, both populations would be homogeneous; in other 
Words, there would be no gradations of ‘normality’ or ‘neuroti- 
cism’, This in turn would imply that tests which differentiated 
*tween normals and neurotics would not show any particular 
Pattern of intercorrelations within the normal group only, or 
Within the neurotic group only. On the average, these inter- 
correlations should amount to zero. hs. 
On the basis of the quantitative hypothesis, our prediction 
would be quite different. We may approach this matter in the 
following way. Let us consider two tests, A and B, both of which 
are known to discriminate significantly between normals and 
neurotics. Now let us take our group of normal subjects only. If 
Our quantitative hypothesis is correct, it follows that this normal 
group may be subdivided into two parts at some arbitrary point X. 
Ose to the left of X are relatively more stable, better integrated 
People than those to the right of X. It would follow that they 


tha 
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should do better on both tests A and B than those on the right. 
But if there is a tendency for some people to do well on both tests, 
and for others to do poorly, then quite clearly these two tests 
would be found to be correlated. On the basis of our quantitative 
hypothesis, therefore, we would expect А and B to be positively 
correlated. Generalizing this to n tests, we may say that any group 
of tests which discriminate between normals and neurotics should 
show positive intercorrelations when only the normal group is 
considered. The same considerations, of course, apply to the 
neurotic group. This group also may arbitrarily be divided into 
two at an arbitrary point Y, and again it can be shown that on the 


basis of our quantitative hypothesis positive intercorrelations 
would be expected. 


We go one ste 


(within either the normal or the neurotic group) be positive; they 


Ability of the respective tests to dis- 
nd neurotics. If a test discriminates 


group and factor analyse ‘it, then our factor saturations shoul di be 
proportional to the correlations of the tests with the normal ver. 
neurotic criterion. As a corollary, it follows that the factor s ‘sus 
tions derived from the two matrices (the norm the atur ar 
should be proportional to each other. neurotic) 
An example may make this argument clearer, 
have given the intercorrelations and the factor 
tests of neuroticism, for a mixed normal and neurotic M 
Below are given the matrices of intercorrelationg for Population. 
neurotic subjects separately. Results of a factor anal Ormal and 
given in each case. YSIS are also’ 
In Table VIII are given the factor saturations fop 
for the normal and the neurotic groups Separate} € four tests 
Criterion Column (i.e. the column of biseria] Correlati Y and the 
tests and the dichotomy normal versus neurotic), їе 2005 between 
to the casual observer that the three columns sho © obvious 
proportionality. 


al and 


In Table II we 
Analysis of four 


distinct 


амаг "ч ей 
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TABLE VI 
- NORMAL GROUP 
| 80 “52 50 | +28 
n | | 
o (64 | м 46 | 9 
'52 4t | (27 -20 16 
50 | 46 | 9 | (55) | аз 
:28 | 19 16 35 | (08) 
TABLE VII 
NEUROTIG GROUP 
| -65 | -68 42 42 
$5 | (4а) | “44 28 3 
8 | 44 (49) “30 31 
“42 28 30, | (18) 18 
42 | -28 3I 18 (18) 


TABLE VIII ` 
ileri Factor Saturations: | Factor Saturations: 
кеш Nerul ‘Group Neurotic Group 
'*66 -80 ji -65 
`38 +52 -68 
:35 | 50 9 420 
"33 | -28 42 


E гаш, of course, no conclusions should be based on 
tum rom four tests, and on simple casual observation of degree 
mudo Уралын We may therefore broaden the scope of our 
нэ Ple and present data from a rather larger sample of tests. 
ing Ко ople was originally taken from an investigation employ- 
КО Cen tests altogether (Stouffer, 1950). We have calculated, 
3 Асана given in the original report, factor saturations 
E © normal group and factor saturations for the neurotic 
vnb" and compared these with the criterion column. The full 
Po are set out in Table IX. Again, it will be seen that the 
sa T Saturations are proportional to each other, and .to the 
the 23 Column. This proportionality can best be expressed in 
ibit; orm of a correlation coefficient. The two columns of factor 
rations correlate together -82; the Criterion Colunin correlates 
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with the factor saturations for the normal group :88, and with the 
factor saturations for the neurotic group :81. All three values are 
highly significant, and are clearly different from the zero values to 
be expected on the basis of the qualitative hypothesis. We may 
therefore conclude that our data lend Support to the quantitative 
hypothesis, and refute the qualitative hypothesis. 


TABLE IX 
| Син Factor Factor 
Title of Scale nin Saturations: Saturations: 
amen | Normal Group | Neurotic Group 
1. Psychosomatic complaints -66 69 (-15) -56 (-16) 
2. Childhood neurotic symptoms -38 49 (-09) -58 (—-01) 
3. Personal adjustment ( —) “42 67 (-09) 68 (-05 
4. Over-sensitivity 33 -48 (45) -56 (-50) 
5. Childhood fears . 33 42 (16) -52 (-02) 
6. Acceptance of soldier role (-) -35 58 (—-15) -48 (—-29) 
л Worrying 27 | 59602) | -56 (-o9) 
ED 33 | 533 (03) | 556 (-08) 
9. Participation їп sports (—) 28 28 (14) ‘34 (— 26) 
10. Identification with war (— ло 4о (— 02) NEC 
11. Childhood fighting behaviour (-) “8 Зо (—:32) лу (550) 
12. Childhood school adjustment (—) “rr "15 (09) 119 (006) 
13. Relations with parents “12 чо (-o6) (ос) 
14. Emancipation from parents (—) “08 220: 6:94) | «18 (—-24) 
15. Mobility "09 —'04 (-31) 05 (-24) 


: 
Figures in brackets are referred t 
first reading. Tests whose scores ha 
coefficients in the Criterion Colu: 


9 on page 58 ang sk 
ve been multiplied by 
mn positive are 


‘ould be omitted at 
— Tin ord ake 
marked thus: 130 га. 


So far, we have been concerned with patter : b 
lations which show the particular hierarchical i шт». 
features which are summed up in saying that ne m 
ences vanish. As it happens, however, matrices hs iR d di A 
particular pattern are quite rare, and in the majority B EU. zx 
hypothesis that only one factor is responsible fop the ae а 
tions сап be shown to be false. When that happen; ae correla 
into rather more difficult territory, and be in to dea Move over 
factor analysis, i.e. we attempt to account foy Our ob, BEEN 
tions in terms of several independent factors, €d correla- 

Let us again begin with an example, Be ow (Tabl 5 

iven the intercorrelations of four tests (questions 29le X) аге 


3 +. ODnajr ‘ 
with-lack of personal adjustment, oversensitivity. fay 5) dealing 
emanci- 
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pation from parents, and mobility. No factor analysis was carried 
out, and no diagonal values are therefore given. 


TABLE X 
A B с р 
А — 49 264 "04 
B 49 — 05 “12 
С “II “05 = —-28 
D E ча —-28 — 


It will be seen that the negative correlation between tests C and 
D Completely upsets any attempt to account for the correlations in 
this table in terms of one factor common to all the tests. Except 
for it, all the other tests correlate together positively, showing that 
they all measure something in common— presumably the same 
factor of ‘neuroticism’ we found before. Then how can we account 
for the presence of this negative correlation? А 
, Perhaps the nature of the tests will give us a clue. ‘Emancipa- 
Поп from parents’ is defined by the authors of the test as involving 
both actual physical emancipation involved in leaving home, and 
the psychic emancipation implied by the establishment of satis- 
factory heterosexual relationships. ‘Mobility’ denotes unstable em- 
Ployment record and much geographical moving around. Both 
the ‘mobile’ and.the ‘non-emancipated’ tend to be more neurotic; 
yet the correlation between the two tendencies is negative. Clearly 
another factor is at work over and above ‘neuroticism’, and it may 
be surmised that there are two kinds of neurotic: the type that is 
caught up in mother’s apron strings, is shy, sensitive, worrying, 
dysthymic’, anxious, and depressed; and the more hysterical type 
that has no difficulty in breaking emotional bonds (perhaps be- 
cause they are never very firmly established?), that has never 
Settled down, has little persistence and seldom sticks long at any- 
thing once the initial enthusiasm has worn off. In other words, 
we may here be dealing with a manifestation of the introvert- 
extravert dichotomy as popularized by Jung (1923), which 
stemmed from just such a clinical differentiation, made originally 
by Janet (1909), of the main neurotic syndrome. | - 

This hypothesis was suggested by our small matrix of inter- 
Correlations; clearly it requires some proof before it can be taken 
Very seriously. If it were correct, what other tests would we expect 
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to correlate with this hypothetical introvert-extravert continuum? 
The introverted, non-emancipated child would presumably not 
have done as much childhood fighting as the extravert; he would 
not have participated in sports to the same extent; he would have 
greater difficulties in assuming the soldier role. The extravert 
might be expected to show, in addition to greater mobility, a 
tendency to have less close relations with his parents, and to be 
easily offended and resentful of criticism (grouped together by the 
authors of the test of 'oversensitivity"— perhaps à somewhat mis- 
leading term). 

Let us return to Table IX, We 
of ‘neuroticism’ from the intercor: 
and neurotic groups separately, 


the normal, the other from the 
that 


> 18 to sa тос 
and positive saturations. This is an inevitable Вау ри ы Me 
of extraction to which we shall return again. here let 5 ПОС 
that very roughly our anticipation is borne 7 уша nn 2 
‘emancipation’ is at the Opposite end of the scale To oom pe 
mobility’, and 
that roughly at least the other tests are grouped as рс e y lee 
second, bipolar, factor is not likely to be т ЭХ lcted. Thi 
menon, because it appears in a Very similar form in s a 
independent factor analyses—the one for the normal, the T € Ч 
the neurotic group; the correlation between the bio Bt other for 
This example is not presented аз proof of the eR is 91. 
introversion-extraversion factor; quite clearly there ire Ce of an 
steps missing in the argument, and too much Subject: 
reasoning, to allow us to reach any such far-reachin Scie d 
What we can conclude, however, without much ^ пов 
to our argument, is that more than one factor ies Racking 
account for the observed correlations. Does this re Quire, е io 
in any way the methodological considerations disc Lt affect 


i e 
nection with the case of the ‘single factor’ hypothesis in con- 
ately the breakdown of the simple model necessitate. Pes Ortun- 


IX, in brackets. It wi бсей 
that the new factors (one from ckets. It will be noti 
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complex rethinking of the logic underlying factor analysis, and to 
this we must next turn. Before doing so, however, we must intro- 
duce a simplified, diagrammatic method of presenting the results 
of factorial analyses which will be indispensable for our argument 
If we wish to avoid complex mathematical expressions. 

Let us go back to Table IX, in which were given two factors 
extracted from each of the two matrices of intercorrelations 
Observed, i.e. that derived from the normal and that derived from 
the neurotic population. Consider only the values obtained from 
the normals. Casual observation shows, and it can be proved by 
actual calculation, that the two factors we extracted from this 
matrix are independent of each other, i.e. they are not correlated. 
The fact that a test has a high saturation on one factor does not 
enable us in the slightest to predict whether its saturation on the 
other factor will be high, low, or negative. This notion of inde- 
pendence can be shown to correspond to the concept of ortho- 
Sonality (being at right angles to each other) in geometry. We 
may therefore plot our two factors at right angles to each other, as ^ 
in Figure 4. Each line represents a factor, the horizontal line the 
first factor, the vertical line the second factor. Both are equal in 
length, and degrees of saturation are marked off on each, going 
from + гоо through -oo to — 1-00. A circle is drawn around this 
structure to delimit what is called the “two-factor space’. It is of 
Course easy to add other factors to this structure; thus a third 
factor might be added, at right angles Чо the other two, which 
would stick out from the intersection of factors one and two (the 
origin) at right angles to the plane of the paper. Factors beyond 
three cannot be represented visually, but can easily be treated 
mathematically according to the rules of n-dimensional geometry; 
factors are then considered to lie in hyper-space. — і 

So far we have only drawn the framework within which our 
Observed factors can be plotted. Test 1 has saturations of 69 and 
"15 on the two factors; consequently we plot it in Figure 4 in the 
Corresponding position. Tests 4 (with saturations «шыла ад, 

(with saturations of :58 and — 15), and 14 (with saturations 
of 20 and — -34) have also been plotted in the diagram. The 
Other tests are not plotted because they would obscure certain 
Important relations which are vital to an understanding of the 
Implications of this graphic method of representation. 

Let us connect the points marking п. Зэнрэн of tests 4, and 14, 
to the origin by means of straight lines. It can be shown by simple 
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trigonometric calculation that if we multiply the cosine of the 
angle separating these two lines (angle «) by the product of the 
lengths of these two lines, we obtain the correlation between the 


Gr 


Ó ч 


2.8 
SECOND FACTOR 


Figure 4 


two tests in question. It can be seen from the qi 
ia 


angle u is about go degrees. The cosine of an an Ёгаш that the 
is of course Zero, so that the total product must | € of go degrees 
(The actual correlation in fact is :02.) 


© close to zero. 
Similar lines can be drawn between the origin E 
Nd the ро 

points 
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representing tests 1 and 6. In each case, the correlation between 
any two tests is represented by the product of the length of the 
lines connecting them with the origin, multiplied by the cosine of 
the angle between the two lines. 

The lines which we drew between the origin and the points 
representing tests 4 and 14 are technically called vectors. (A vector 
is a line having a given length and direction.) It will be seen that 


EXTRAVERSION 


NEUROTICISM 
FACTOR 


Figure 5 


each test is represented by a vector, and that these vectors form a 
System which gives a diagrammatic representation of the relations 
obtaining between the tests, just as the matrix of intercorrelations 
Save an arithmetical representation of these relations. In either 
case we are dealing with a pattern which we are seeking to describe 
in the simplest possible way. e 4 

Let us next plot in two factors space all 15 tests, using the 
neurotic sample values from Table IX. Figure 5 shows the 
results. The reader may care to plot the values from the normal 
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sample himself for comparison; they are very similar to, though not 
exactly identical with, those shown in Figure 5. . 
~ Tt will be noted that the upper and the lower halves of this 
diagram are rather nicely balanced. There are more points in the 
upper half, but those in the lower half have higher saturations on 
the second factor. In fact, it would appear as if the first factor axis 
passed exactly through the centre of the whole swarm of points. 
‘This is indeed the case; the method of extraction of a factor 
ensures that it will as it were be a multidimensional average (a 
‘centroid’) of all the vectors involved. This feature of factor 
analysis is at the same time a strength and a weakness; just as an 
average may be meaningful or meaningless, dependent upon 
precisely what is being averaged, so the factors extracted from. 
a matrix may be meaningful or not dependent upon what pre- 
cisely is being intercorrelated. Factor analysis is often considered 
as a kind of sausage machine—correlations are put in and out 
comes the psychologically meaningful and scientifically valuable 
result. This view is not shared by any responsible investigator; like 
all other statistical methods, factor analysis is far from fool-proof, 
and where psychological insight, careful planning, and an under- 
standing of the hypothetico-deductive nature of scientific methodo- 
logy are lacking, no mathematical brilliance can suffice to salvage 
the results of misapplied ingenuity. To admit that factor ana] i 
has occasionally been misapplied, and that results reported b 
some of its overenthusiastic users have often been, absurd dogs ; 
amount to an admission that critics are justified in facie : ыг 
examples in condemnation of factor analysis as а Scientific A : 
almost any scientific tool can be misused by the unini fated ethod; 
less of its general value and importance. Criticism im р, oue. 
not on occasional absurdities, but on the best available е based, 
of the methods properly used and correctly interpreted examples 
to its own rules. Any other evaluation is merely semanti according 
boxing. € shadow- 
The admission that our factors are nothin t ау, р 
very damaging one; indeed, criticism of factor analys, 2888 isa 
not merely based on misunderstanding Centres Jay e Which is 
point. The ‘g’ we extract from a matrix obeying ies Y on this 
tetrad.criterion is not an average in this sense; we Ser rules of the 
tract tests from our battery at will, and as lon Si add or sub. 
tetrad criterion, we can be sure that we are alwa obeys the 
same, identical *g. But the moment that We are нил the 
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matrix which does not obey the tetrad criterion, and which requires 
therefore more than one factor, we leave objectivity dangerously 
far behind, and are forced to enter into many circuitous byways т 
before emerging once more into the main stream of scientific 
methodology. M 

A factor, in order to be useful in ordering our descriptions of 
the multiform facts of psychology, must have two characteristics. 
It must be unique, and it must be invariant. By unique we mean that 
from a given matrix one and only one solution should be possible 
if the rules of extraction are followed. Ву invariant we mean that 
however much the battery of tests may be altered by adding other 
tests, or subtracting tests already included in it, nevertheless the 
factor saturations of those tests which are retained in the battery 
should not change. It may be said that rules can be laid down 
Which ensure the uniqueness of the factors extracted by any of the 
current methods; it is with respect to invariance that our troubles 
start. 

Let us look again at Figure 5. Factor т is balanced carefully 
between the positive and the negative sides of the bipolar second 
factor, in such a way that the factor saturations of the tests having 
positive saturations on the second factors balance out almost 
exactly the factor saturations of the tests having negative satura- 
tions. What would happen if we omitted test 11 from our calcula- 
tions? The relations between all the other tests (as indicated by 
their positions in,the two-factor space) Would remain quite un- 
affected, because they represent the correlations of these tests, 
which are of course not changed in any way by dropping any 
particular test. But the position of our two axes would be changed. 

he first one would pass through the new average (centroid) of 
our cluster of fourteen tests, and would therefore lie approximately 
as shown in Figure 5 as position A. The second factor axis, in order 
to remain at right angles to the first factor axis, would of course 
shift through a similar angle to position A’. Take out another test 
HOW, say test 6, and again the position of the axes will shift, this 
time to B and pr, Take out tests 9 and 14 too, and now our axes 
Would assume positions С and C'. Clearly, these axes are in no 
way invariant, but shift about depending entirely on the make up 
Of the battery of tests. In other words, they are almost useless in 
their present form for our purpose. X 
ne way out of this difficulty has been suggested by Thurstone 
(1935), and has given worthwhile results in the field of mental 
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abilities. He lays down certain mathematical rules which, in a 
suitable matrix, will give a rotation which is at once unique and 
invariant. This final position he calls ‘simple structure’, and he 
defines it by saying that in simple structure as many of the axes as 
possible should be at right angles to as many of the original test 
vectors as possible. Аз we have seen, ‘being at right angles to’ 
means simply ‘having zero correlations with’, and this demand 
therefore means that as many of the tests should have zero satura- 
tions for as many of the factors as possible. More particularly, he 
requires that every factor axis should be at right angles to at least 
as many test vectors as there are factors, and th 
be at right angles to at least one factor. In practi E 

amount to this: (1) Each test should have A eine e 
tion. (2) Each factor should have at least as many zero saturations 


as there are factors. (3) There should be at least as many XO or 
OX entries in each pair of factors as there ат 


: ге factors. СО or 
OX entry means simply a zero satur: (An X 


i ation in one factor accom- 
panied. by a non-zero saturation їп another.) 


Experience has shown that these requirements are met by 
matrices of observed correlations t 


00 rarely to be of much signifi- 
оше long ааш orthogonal pattern of factor axes is sba 
Thurstone therefore permits his factors to be correlated Siih each 


other, so that his factor axes are no lon i 
3 a ger at right 
other. Figure 6 shows the kind of pattern which is fons 20 


found in a Thurstone-type analysis. I : 

factors as determined from the Орц а a the centroid 
rotated factors showing ‘simple structure’ and а с n 5 € he 
by the cosine of the angle «. Thus once the factor, DIT elation given 
been found and rotated, we c Sina matrix have 


ап write another 
: m ; er i soter 
correlations giving the correlations betwee: жир of inter- 

: this in turn 


can be analysed, and in the case of tests ES ы йа 
the tetrad criterion, thus giving rise to one oni Seems to obey 
factor which corresponds very closely to Spearm, “order or super- 
seem to be able to recover the impressive sim үе 5 ‘g’. We thus 
ness of Spearman's original picture, without сүй and orderli- 
account the additional ‘group factors’ which caving out of 
model inapplicable. made his simple 
The advantage of this method of analysis which ү: 
of all, unique and invariant primary factors, кү gives us, first 
on the intercorrelations of these primary factor then a 56 based 
‘g will not change with addition or omission of s obvious, Our 
$565, but will 


at every test should 
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remain uniquely and invariantly defined. This would not be so if 
we were to extract ‘g’ from our original battery of tests, without 
first fitting them into a ‘simple structure’ pattern. Such at least 
would be the claim of those who follow Thurstone in his method. 
To a considerable degree these claims appear to be justified. 


i 


Figure 6 


Experimental results do appear to efforts in patterns of inter- 
correlations which allow the research worker to use the methods of 
‘simple structure’, and factors thus discovered do seem to correlate 
In a hierarchical pattern which obeys the tetrad criterion. It may · 
be objected that angles of rotation are not really uniquely deter- 
mined, in view of the difficulties caused by sampling errors, and 
that the invariance of the resulting factors has not been demon- 
8.8.Р. Е 
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strated experimentally under stringent conditions of univariate 
and multivariate selection. These objections do not attack the 
principle which Thurstone is attempting to establish, and there is 
good prospect that in time the mathematical problems may be 
solved, and the experimental evidence be supplied. At the very 
least, we would appear to have here a method which holds out а 
hope that through its use we may be able to solve the taxonomic 
difficulties in the cognitive field: 
Let us look at the general picture given of the organization of 
mental abilities by factor analysis. We discern four main levels. At 
the lowest level we are dealing with single. observations of be- 
haviour which may be regarded as falling into the general domain 
with which we are concerned. Many thousands of such observa- 
tions are made by all of us every day, with varying degrees’ of 
objectivity, reliability, and validity. We attain a much higher level 
when we can measure the consistency of this behaviour through 
reliability coefficients or other statistical means. This ei usually 
be possible only with test or examination behaviour, although 
teachers’ or other persons’ ratings, social achievement or repeated 
success at some intellectual task might be considered Hike р this 
level we would be dealing with mental tests of known reliabilit 
dealing with a very restricted universe of content—a test Ёл 1 2 
in simple addition might be quoted as an example, or a em Ё 
ability in completing letter series of a given level of diffi ra p 
third level is reached whén we find that these tests in tur: icu S | 
to produce primary mental abilities, such as the verba] n correla Л 
memory, or visuospatial factors. The fourth and highe | ia 
is reached when these primary mental abilities in ty st level of = 
to correlate and give rise to a second-order factor cor rn are foun 
‘g or general mental ability or intelligence. A diagra responding to 
sentation of these levels is given in Figure ve TREN repre- 
enables us to give fairly exact estimates of the actor analysis 
each of these levels to the total variance of a sive contribution of 
generally to give a reasonably accurate picture of th test, and quite 
of all the thousands of separate facts represented е interrelations 
simple diagram as the one presented here, €ven in such a 
What we are doing in factor analysis is simpl 
of the total variance (the total variability on Ч cee breaking up | 
sample of people tested; the variance is equal t €st shown by the | 
standard deviation) into various component 9 the square of the 
let the total variance equal 100 per cent, or ee Thus we may 
mty in order to keep 
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to the order of size of correlation coefficients, and write an 
equation to account for all of the variance of test A: 


суа ==? + as* 4-аз? d s e au? ag deam ү 


In other words, the total variance of test A (04°) is accounted 
for by the squares of the saturations of this test on factors 1, 9, 3; 
to л, plus the squares of the specific factor saturation (s) plus the 
error variance (62). The various parts of this equation have 
separate names. The variance contributed by the factors from 1 to 
nis known as the “communality’, because they constitute what the 
test has in common with other tests; it is usually written 42. Specific 
and error variance together are known as the ‘uniqueness’ of the 
test, because these factors are not shared with any other test. The 
uniqueness (и?) is of course equal to 1 — 4?, because и? and Л? add 


up to 1. The reliability (тш) of a test (its empirical test reliability) 


is given by the formula r,a = 1 — a,?, from which we can deduce 


that if the reliability is known, the error variance can be calculated 
by the formula 4? = 1 — т. These various relations allow us to 
calculate the contribution of the various factors—communality, 
specificity, error—to the total variance of any test in the battery. 
These various relations hold for orthogonal factors; with certain 
changes they can be alterod to fit the oblique pattern also. 

Can this method of multiple factor analysis with rotation to 
‘simple structure’ be taken over into the field of temperament, 
character, attitudes, interests, and so forth? There are à number 
of reasons why such a transfer would be of doubtful value. In thc 
first place, the factor analysts who worked in the field sement 
abilities had concepts and tests available which ha 
gone a long period of development. The faculty 
the one hand, and the early test designers on th 
lined the general field with considerable ingenuity, Everun 
tutored common sense might be expected to give clues to the main 
areas in which primary mental abilities could be found: in the 
field of test material—verbal, numerical, perceptual; in de field o£ 
method of approach —speed, accuracy, and so forth, i Picus t 
this relatively happy state of affairs is not far {о Seek; thr ХЭ rx 
ages philosophers have been concerned with c Ognitive КОЗЫ 
with epistenzological problems, and the genera] dee ed 
possibility of knowledge. Their speculations inthe. eq oa T s 


sich за ва pruning, but they did иар cie reae 


d already under- 
Psychologists on 
€ other, had out- 


` 
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Sides, and created a background of sophistication against which 
more recent workers could proceed. 

Not so in the non-cognitive field. This has always been the 
Step-child of philosophy, psychology, and medicine, and the in- 
Vestigator will not find ready to hand a large number of objective 
tests which he may use, or a set of reasonable hypotheses which he 
may investigate, Instead, he will find a subject still apparently 
enjoying its birth trauma without giving any signs of growing up, 
but filled instead with return-to-the-womb phantasies; a subject 
im Which the most remarkable theories are bandied about with 
careless abandon, and in which not even the notion that some sort 
ОЁ proof is required before hypotheses can be accepted is widely 

nown or shared. Instead of objective tests, he will find the field 
dominated by ratings (in spite of their known unreliability), 
questionnaires (in spite of their known lack of validity), and 
So-called projective techniques which multiply in a horrifying 
Malthusian fashion before his very eyes (in spite of the almost 
Complete absence of proof that they measure anything whatever).' 

nder these circumstances he cannot begin—as could the student 
4) ability—by taking a number of tests and intercorrelating them 
їп an effort to bring order out of chaos; the very tests which he 
may wish to use are not yet in existence, and indeed, it is not at all 
Clear Which mental functions he should attempt to measure even if 
зеге ready to construct new tests. 4 

In the second place, even if the intrepid investigator were to 
succeed in locating a few objective (or even not-so-objective) tests 
Which in his opinion measured temperament, he would find their 
intercorrelations woefully low, and obstinately determined not to 

а into a pattern even remotely resembling simple structure. His 
troubles would be increased by the fact that the notion of the 
Positive manifold? (that all saturations should be zero or positive) 
makes sense in the field of abilities, where negative correlations are 

ardly ever found, but does not apply in the field of affection and 
“nation, And while the notion of the ‘positive manifold’ is not 
absolutely essential to ‘simple structure’, there is no experience to 
gnide the research worker to find simple structure in the absence 
ol à positive manifold. 1 : à 

, In the third place, there are certain practical difficulties which 
Mhilitate against wholesale testing in the field of temperament. It 


1 Gresham’s law would appear to apply in the field of psychological tests го 
less than in that of economic theory! 
` 
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is easy to devise group tests for mental abilities, and to test many 
thousands of school children, students, and adults with them; to 
repeat these tests, or alternate versions of them, in order to obtain 
reliability coefficients; and to carry out all these procedures in the 
class-room, the factory, or any convenient large hall. Not so for the 
type of test which might be used for the measurement of tempera- 
ment. Group testing is nearly always impossible (except for the 
“egregious and not very useful questionnaire); special apparatus is 
required, such as the P.G.R., the Luria, or the C.F.C. machine; 
each test takes a great deal of time—sometimes up to two or three 
hours—and can only be carried out in the laboratory, or the 
hospital, with consequent loss of time to the subject who has to 
journey thither on many separate occasions. Add to all this the 
fact that intelligence testing is ‘respectable’, and encounters little 
opposition nowadays, while tests of temperament are almost 
by definition emotion-rousing, and frequently frightening or un- 
pleasant, as well as relating to thought content not considered 
‘nice’ by. many people, and it will be appreciated why methods 
which can be used in the cognitive field may not be easily trans- 
ferred to the non-cognitive field. 
Do we have an alternative to the method suggested by Thur- 
stone in our attempt to find a set of factors which corresponds to a 
set of psychologically real influences? Cattell (1946) has pointed 
out that there are two ways of arriving at such an objective, The 
investigator, he writes, ‘thay (1) devise possible wa 
‘determining the analysis of the given correlation m 
only one set of true factors will emerge, or (2) start Fon ae 
opposite shore and propound, on psychological grounds Done 
hypothesis about what source traits are operative in the variable d 
Then he will see if these factors correspond to any of the оос fle 
mathematical factors found in the matrix.’ The fist dum e 
methods is that adopted by Thurstone. The & Sond Н | езе 
methods, which is clearly the hypothetico-deductiye meth d these 
has shown itself to be so fruitful in science generally, ig p od which 
Cattell on two grounds. ©... in the first place, Deor Seed by 
has so few other reliable avenues for arriving at, or even s ity study 
the basis source traits, that hypotheses are likely to be uspecting, 
the second place, the mathematical solutions to аан aoe In 
tions are so numerous and varied that unless the hy MAR _ Correla- 
stated in very precise е Ld the "rogi "i ae сп be 
. Cattell re 1 iem 5 еаѕу— 

so easy as to be worthless peats his Position in shy 


YS of over- 
atrix so that 
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different terms: ©... we have rejected one of the two major 
approaches normally approved by scientific method—namely, that 
of inventing a hypothesis about the particular factors expected and 
attempting to discover a factorization to match it—because in this 
field almost any hypothesis could be so “confirmed”. Instead, we 
seek general guiding principles for the mathematical analysis itself 
which will lead to a unique solution.’ 

While I appreciate the great contribution which Thurstone has 
made to the analysis of mental abilities, and while Т am fully con- 
scious of the difficulties Cattell has pointed out in arriving at a 
hypothetico-deductive method of analysis which shall be capable 
of being refuted (i.e. one where the ‘proof’ would not be so easy 
88 to be worthless), I nevertheless believe that progress in the non- 
Cognitive field depends on the discovery of such a method. Accord- 
Ingly we turn next to an attempt to define the principles underlying 
Such a combination of factor analysis and hypothetico-deductive 
method; the general name given to this procedure is that of 

Criterion Analysis! (Eysenck, 1950). The rest of this chapter will 

€ devoted to an exposition of this method; the rest of this book 
0 an application of this method to a variety of theoretical and 
Practical problems. i 

For a clue to the procedure which might be adopted here, we 

may return to an example given earlier in this chapter. It will 

€ remembered that several (questionnaire) tests of ‘neuroticism? 
Were given to groups of normal and neurotic soldiers, the intercor- 
relations for the normals and the neurotics separately calculated, 
and factor analyses on the two matrices carried out. As shown in 
Table IX, two factors emerged from each of the matrices which 
Were very provisionally identified as ‘neuroticism’ and as ‘extra- 
Version-introversion’. Let us concentrate on the first factor only 
for the time being; the arguments which apply to it will be shown 
to apply to subsequent factors also. Let us deal only with the factor 
emerging from the intercorrelations matrix of the neurotic group 
as plotted in diagram form on page бї 3 this is so similar to the 
ал group factor pattern that what is said of the one is also 
applica is 5 7 

An ea э ЖЫ we scem to be involved in а contra- 
diction, We have ‘identified’ the first factor as pink with 
‘neuroticism’; yet we have used the diagram on which we have 
plotted this factor (Figure 5) to show that the factor axis has no 
fixed position, but changes with the addition or subtraction of 
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tests in the matrix. If the factor found is meaningless (as is implied 
in the notion that it fails to be invariant), how can it be interpreted 
at all? The answer lies in the fact, emphasized before, that a factor 
is an average; an average may be meaningful or meaningless 
according to what it is that is being averaged. If we throw together 
tests of intelligence, neuroticism, and introversion, the resulting 
average intercorrelation will be a meaningless jumble impossible 
to interpret. If we throw together a lot of tests of neuroticism, the 
resulting average will have at least a rough-and-ready resemblance 
to meaningfulness, because what is being averaged is all measuring 
much the same underlying dimension. While therefore the axis is 
not permanently fixed, and is certainly not invariant, it will point 
roughly in the right direction—with an error of ame 10 zi 20 
degrees either way. Consequently, the interpretation of such a 
factor, unrotated as it is, is possible, although one's confid * 
this interpretation is purely subjective. pw 
TABLE XI 
List of Tests 


Maudsley Medical Inventor i 
е У--40 item n ici i ir 

Score RE of Teo answered ‘No’ 08755527 06 ааа 

аг] aptation—U.S. Ма Radi 3 
23 КООП ОГх аль уу ium Plaque Adaptometer. Score 
Non-Suggestibility—body-sway t. ili i i 
oe y д y test. Ability to resist suggestion to sway 
Motor Control—absence of static ataxia; 
Goal Discrepancy Score —smallness of 
O'Connor tweezers test. — 
Judgment Discrepancy Scores—smallness of 
O’Connor tweezers test. 


Es as preliminary test to С. 
evel of aspiration Scores on 


а HU О Ww р 


judgment discrepancies on 


G. Index of Flexibility—number of shifts in aspirati 

tweezers test, irrespective of size or direction. ration scores оп O'Connor 
Н. Manual Dexterity—best score of nine trials оп t 
I. Personal Tempo—speed of writing 2, 3, 4, Эрс test, 

15 seconds each. У for two trials of 
J. Fluency—number of round things and of things to е 7 s 

30-second periods. ^! mentioned during 
K. Speed Test (1)—speed of tracing when instructed to b 

accurate. (Choice conditions.) € both quick and 
L. Speed Test (2)—speed of tracing prescribed path о 

instruction to be quick. n track tracer under 
M. Persistence Test I—length of time during which leg 1 

able and fatiguing position. 518 held in uncomfort- 
N.  Persister-e Test B—holding breath as long as possib 

ог oxhaling. ible, without inhalin 
O. Stress T ees of 5 to recover previous Scoring rat E 

t t ter ial st iod. ае ў 
Ё уре of test after special stress period. Оп pursuitmeter 


Non-Perseveration—extremes of perseveration 57 
very low, аге scored low, while scores пеагег the xod cither very high or 
авс are scor : 
ed high. 
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However, it will be remembered that we did go a little further 
than this and provided a criterion to indicate that this interpreta- 
üon was at least partly justified. We calculated the 'criterion 
column’, i.e. the column of correlations of each test with the 
normal-neurotic dichotomy, and showed that our factor satura- 
tions correlated to the extent of -81 with this ‘criterion column’. 
This is objective evidence in favour of our hypothesis, and repre- 
sents a genuine application of the hypothetico-deductive method, 
because we had shown previously that on the basis of our main 
hypothesis regarding the existence of a neuroticism continuum, and 
on the basis of subsidiary hypotheses regarding the differential 
Success of the various tests used in measuring this continuum, such 
а correlation could be predicted. This still leaves our factor rather 

wobbly’ and far from invariant, but it does suggest a further step 
Which will tie it down completely to a position which is both 
unique and invariant. That position is one in which the correlation 
.ctween the column of factor saturations and the criterion column 
15 а maximum, In other words, the factor axis is to be rotated in 
Meha way that factor saturations correlate as highly as possible 
With the criterion column. 
ex З: Us give an example of this procedure by reference to an 
te 5 B cm using objective tests rather than questionnaires. The 
e suo rated in Table XI were given to 93 normals and 105 
test ^R Subjects, and the correlations calculated between each 
сс and the normal. neurotic dichotomy: These correlations are 
уеп in Table XII under the heading C, (Criterion Column— 
Suroticism), Product moment intercorrelations were worked out 
“tween the tests for 64 of the normal subjects who had carried out 
all of the tests, and the matrix of intercorrelations factor апа1ув:4. 
Wo factors were extracted, and are given in Table XII under the 
cading F, and-F,; the heading A? indicates in each case what 
Portion of the total variance of each test is accounted for by these 
‘Wo factors, Rotation of the two factor axes through an angle of 
five degrees makes the correlation between first factor saturations 
and criterion column a maximum (r = 1574), Е us the 
Values for the rotated first factor given in column D. А graphic 
5 is ої in Figure 8; axes I and II are 
€monstration of the results is given 1n £18 1 
i . D is the rotated first factor axis. To 
€ centroid axes as found; D is t dé 
Sho cn if an attempt were made to approach 
“Ow what would happ : DENN io: they Sill be 
Simple structure, axes I’ and II have 3 y 1 


Seen to be almost completely uninterpretable. 
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TABLE XII 
ын z x 
Dom. ® b зэ kt Bal D 
= | | 
А 23 143 | "PII +065 о8о 127 
B 27 392 | —'220 | -202 256 407 
с. 51 620 | —-416 *557 409 650 
D 54 644 | —-438 -б07 425 | 675 
Е o6 100 -089 018 059 | 094 
F 10 497 —'455 | 454 333 -52 
G 05 275 | --397 233 191 303 
н 57 405 | —-078 170 258 410 
I 30 438 175 | 222 267 424 
1 03 300 | -018 -090 4188 200 
K 27 '523 "100 284 | әд | +515 
L 17 *565 461 4532 | 4388 | 552 ) 
М | 46 | %6у | 384 | +516 -363 576 
Хо 29 632 430 | 584 377 599 
О 24 294 == 4 7097 +189 +300 
Р 21 207 241 “IOI эд 189 
з — іе 
203 | 093 | -296 


Explanation of Column Headings: 

Cy = Criterion Column, 
dichotomy. 
F, and Е, = First and second unrotated fa 
tions of normal group only. 

h? = Communality. 
F, rotated into maximum correlation with criterion column 
D with vector extended to unity 


Le. correlation of each test with normal-neurotic 


ctors from analysis of intercorrcla- 


j= 
(LED 
Po. f 15187 
Several points should be noted in this example, The angle 
required to get maximum correlation between F, A ЧС. is very 
small; as all the tests were chosen as measures of neuro icis 1 еб 
average (the first centroid factor) is therefore meaningful P kon 
to the optimum position. This optimum Position is unique and 
invariant; the addition of new tests, or the Subtraction of sti Ён 
would not alter the position of D. The relatively hi d one | 
correJation:between C, апа F, 18 in line with our h E a posu | 
the factor 45 one of neuroticism and Supports НЬ 16518 ОК 
neuroticism is a continuum. The grouping of the не ка t 8 
“factor is in conformity with what in previous investigatio ep 
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been shown to be characteristic of the introvert-extravert dicho- 
tomy. Thus introverts have been shown to be more persistent, and 
extraverts to show less judgment discrepancy, and somewhat less 
suggestibility, as well as better dark-adaptation. None of these 
points will be stressed here as the study mentioned was only a 


preliminary one, using too few subjects to be convincing, and ` 


employing a selection of tests which was much improved in later 
work. It is given here only for the purpose of illustrating the 
method under discussion. 

At this point the critical reader may feel that we have jumped 
from the frying-pan into the fire. We have managed to get rid of 
the subjective element involved in locating our factor axes, only to 
link up. this rotation with the even more subjective criterion of 


psychiatric diagnosis. If our final aim were merely to obtain 


maximum correlation with the criterion, i.e. with the diagnosis 


‘normal’ or ‘neurotic’, then some form of multiple correlation ог 
discriminant function analysis would be indicated. Clearly, а 
further step is required. 

Let us look more closely at our criterio 
who have been thus diagnosed by a qu 
full consideration of their past work hist 


behaviour, phantasy life, and social adaptation, as well as their 
current symptomatology. ‘Normals’ are the undifferentiated mass 
of people who have not at any time of their lives been зо dia nosed 
and who at the time or the investigation are, leadin a tively 
normal lives and are not experiencing symptoms anaes rela me 
lead them to seek help from a psychiatrist, The ‘ne сеоце ud 
itself is of course not entirely homogeneous but EE E 
various ways. Some are bright, others are dull; ais e Т 
verted (hysterics), others аге introverted (dysthym : me are ke 3 
additional psychotic symptoms, others do not us some cnet 
agreement among experts that certain groups fall i SON 18 muc | 
field of ‘neurosis’, e.g. hysterics, anxiety states te the деле ы 
sions; there is no agreement at ай regarding ск i actiye depres 
the psychopathic states, or the obsessional and = groups, such as 
Some people would argue that ‘psych: mpulsive groups. 
from ‘nevrotics’, and should therefo t be differentiated 
group; others would claim that psych 


n. ‘Neurotics’ are people 
alified psychiatrist, after 
ory, illness record, sexual 


Clearly, our criterion is in a very doubtful resembling each otlier. 
of clarification. This can be applied by a 
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our principle of maximizing the correlation between actor (F3) 
and criterion (С), which leads us to a double maximization principle. 

_It is well known that the correlation between а test and a 
criterion can be increased in two ways: by improving the test, and 
by improving the criterion. Similarly, we can increase the correla- 
tion between factor and criterion column in two ways: by improv- 
ing our factor measurements, and by improving our criterion. So 
far, we have considered examples in which the criterion group 
consisted of neurotics suffering from those disorders and symptoms 
most commonly agreed to define neurosis: hysterical conversion 
Symptoms; somatic correlates of anxiety; feelings of depression, 
Worry, and fears unreasonably severe in relation to their causes; 
amnesias and other disorders of memory not caused by any obvious 
Physical agent or injury. This criterion may be only a relatively 
rough approach to a correct and perfect criterion, and this lack of 
Perfection in the criterion may be reflected in the failure of the 
Correlation between F, and C, to reach perfection. If that be so, 
then clearly any improvement in the criterion would result in an 
increase in the correlation between F, (always rotated into a 
Position of maximum correlation with C,) and Cy, determined 
now by the new criterion chosen. 

8 ап example, let us suppose that the large group of psycho- 
paths belongs functionally into the neurotic field, as is maintained 
à many psychiatrists. Then if we changed our group of neurotics, 

5 used in the experiment summarized in Table XI, to include 
qu percentage of psychopaths, we would presumably obtain 
€rent scores on all our tests for this new, enlarged ‘neurotic 
Sroup, and consequently a changed column of correlations between 
‘ests and normal-neurotic dichotomy (C’,). If the hypothesis that 
Psychopaths did belong to the neurotic group were correct, we 
would expect the correlation between F’, (rotated into maximum 
Correlation with the new C’,) and the new C^, to be as large or 
larger than the correlation of F, had been with the old С,. A drop 
in this correlation would indicate very clearly that our criterion 
ad not improved, but had become attenuated by the inclusion of 
this ‘foreign body’, and that consequently ‘psychopathy’ did not 
form a part of the neurotic criterion. The same procedure could of 
Course be gone through with other groups—obsessionals, epileptics, 
and so forth, in an attempt to obtain the maximum possible 
Correlation between criterion and factor. Given that the proper , 
criterion could be approached in this manner, a correlation of 
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near unity should be capable of achievement—as will be shown 
later, correlations of :9o and -95 have already been reached be- 
tween factor saturations and criterion columns. 

The essential points in ‘criterion analysis’ will now have become 
clear. We start with a given criterion for some hypothetical con- 
tinuum which we suspect, on purely psychological grounds, to be 
of general interest and importance. This criterion will in all prob- 
ability be very impure, attenuated, and inaccurate. We create 
measuring instruments which can be shown to give some rough 
measure of correspondence with the criterion; we then study the 
interrelations of these measuring instruments by means of factor : 
analysis, to obtain evidence of the existence of our hypothetical 
continuum, and to discover whether it is related in some rough 
way to our criterion. As has been shown early in this chapter, We 
may in this way actually transcend our criterion (the example of 
the teachers’ ratings of intelligence as opposed to the intelligence 
test will be remembered). We may therefore now use our factor 0 
improve our criterion, until we reach a position of exact equival- 
ence. When this is done, we may assume that our original hypo- 
thesis. was justified. This process may appear like arguing in 2 
circle, but in reality this method of argument corresponds precisely 
to the method used by physical science to define even ‘such 
eae и 5i ‘length’. We may with advantage quote 
eG aise ussell’s (1948) discussion of physical measurement at 

“Measurement, even of the distance to 
up from measurements of distances on the 
terrestrial measurements start with the 
bodies may be regarded as approximate 
your room, you assume that your foot- 
ably longer or shorter during the proc 
Expand determines most йс. by аша аше survey 9! 
cess demands that there shall be at least one Tun but this proj 
measured directly. In fact, a base line on Sali “Зо which 5 
chosen, and was measured carefully in the zie игу Plain was = 
which we measure the size of our room: idi rod way in 
take as bv definition of unit length, was реа which we may 
surface of the earth along a line as nearly straish У applied to the 
опе lensth*having been determined directly E t as possible. This 
the measurement of angles and by calculation: d Test proceeds by 
earth, the distance of the sun and moon, and 5 the diameter of the 

: even the distances of 


remove nebule, is built: 
surface of the earth, and 
assumption that certain 
ly rigid. If you measure 
rule is not growing appreci- 
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the nearer fixed stars, сап be determined without any further 
measurement of lengths. 

‘But when this process is scrutinized it is found to be full of 
difficulties. The assumption that a body is “rigid” has no clear 
meaning until we have already established a metric enabling us to 
compare lengths and angles at one time with lengths and angles at 
another, for a “rigid” body is one which does not alter its shape or 
size. Then again we need a definition of a “straight line”, for all 
our results will be wrong if the base line on Salisbury Plain and the 
lines used in triangulation are not straight. It seems, therefore, 
that measurement presupposes geometry (to enable us to define 
‘straight lines”) and enough physics to give grounds for regarding 
some bodies as approximately rigid, and for comparing distances 
at one time with distances at another. The difficulties involved are 
formidable, but are concealed by assumptions taken over from 
Common sense... . Common sense, assumes, roughly speaking, 
that a body is rigid if it looks rigid. Eels do not look rigid, but steel 
bars до... . Common sense, in so thinking, is Newtonian: it I5 
Convinced that at each moment a body intrinsically has a certain 
shape and size, which either are or are not the same as 18 shape 
and size at another moment. Given absolute space, this conviction 
ns meaning, but without absolute space it is prima facie meaning- 
ess... А 
. ‘As in the case of the measurement of time, three factors enter 
S first, an assumption liable to correctión; second, physical laws . 
Which, on this assumption, are found to be approximately true; 
third, а modification of the assumption to make the physical laws 
more nearly exact, If you assume that a certain steel rod, which 
looks and feels rigid, preserves its length unchanged, you will find 
that the distance from London to Edinburgh, the diameter of the 
earth; and the distance from Sirius, are all nearly constant, but are 
slightly less in warm weather than in cold. It will then occur to 
you that it will be simpler to say that your steel rod expands with 
heat, particularly when you find that this enables you to regard the 
above distances as almost exactly constant, and, further, that you 

. Сап see the mercury in the thermometer-taking up more space и 

Warm weather. You therefore assume that apparently rigid bodies 
expand with heat, and you do this in order to simplify the:state- 
ment of physical laws. B. А , 

‘Let us get clear as to what is conventional and what is physical. 
fact in this process. It is а physical fact that if two steel rods; 


- 
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neither of which feels either hot or cold, look as if they were of the 
same length, and if, then, you heat one by the fire and put the 
other in snow, when you first compare them again the one that has 
been by the fire looks slightly longer than the one that has been in 
the snow, but when both have again reached the temperature of 
your room this difference will have vanished. I am here assuming 
pre-scientific methods of estimating temperature: a hot body is one 
that feels hot, and a cold body is onc that feels cold. As a result of 
such rough prescientific observations we decide that the thermo- 
meter gives an exact measure of something which is measured 
approximately by our feelings of heat and cold; we can, then, 48 
physicists, ignore these feelings and concentrate on the thermo- 
meter. 16:18 then a tautology that my thermometer rises with an 
increase of temperature, but it is a substantial fact that all other 
thermometers likewise do so. This fact states a similarity between 
the behaviour of my thermometer and that of other bodies. 

‘But the element of convention is not quite as I have just stated 
it. I do not assume that my thermometer is right by definition; on 
the contrary, it is universally agreed that every actual thermometer 
is more or less inaccurate. The ideal thermometer, to which actual 
thermometers only approximate, is one which, if taken as accurate; 
makes the general law of the expansion of bodies with rising tem- 
perature as exactly true as possible. It is an empirical fact that, by 
observing certain rules in making thermometers, we can make 
them approximate more and more closely to the idea] thermometer; 
and it is this fact which justifies the conception of temperature as 4 - 
quantity having, for a given body at a given time, some exact value 
which is likely to be slightly different from that shown by any 
actual thermometer. 

_ "This process is the same in all physical measurements. Rough 
measurements lead to an approximate law; changes in th T 

А В e measu 
ing instruments (subject to the rule that all instruments for measur- 
ing the same quantity must give as nearly as possible the same 
results) are found capable of making the law more 1 асї. 
The best instrument is held to be the one that kc m ex 
nearly exact, and it is assumed that an idea] instr аз Ја 
make the law quite exact. ишеп wo 

«тр; 1 

This спар though it may seem complicated, is still not 
complicated enough. There is seldom only оце lae а 
very often the law itself is only approximat хонед 


ОР 5 9 
different quantities are interdependent, as we i sane E 
seen in 
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case of length and temperature, so that a change in the way of 
measuring one quantity may alter the measure of another. Laws, 
Conventions, and observations are almost inextricably intertwined 
in the actual procedure of science. The result of an observation is 
usually stated in а form which assumes certain laws and certain 
conventions; if the results contradict the network of laws and con- 
ventions hitherto assumed, there may be considerable liberty of 
choice as to which should be modified.’ 

The close similarity between this mutual interplay of rough- 
and-ready, common-sense observation, measurement based upon 
such Observation, and modification to give more precise inter- 
relations of measurements, with the procedure of 'criterion analysis 
will be obvious. Criterion analysis attempts to do in a formal 
manner in psychology what has always been done in the physical 
"Sciences, Our original hypotheses about temperature are denyed 
from ‘clinical? observation of our own feelings of hot and cold; 
this forms our original criterion. There is no doubt that this 
Criterion is inaccurate, fallible, unreliable, and сарое Опе 
Person's judgments of ‘hot? and ‘cold’ are not necessarily identical 
With another’s, and may deviate considerably; external factors 
Such as humidity play an important part in our judgment, as do 
internal factors such as our state of nutrition, or the amount of 
exercise we have taken. But roughly these feelings of ‘hot’ and ‘cold 
can be shown to correspond with certain physical phenomena, 
Such as the contraction and expansion of metals, the pressure of 
88565, or the emission of electrons from a heated surface. As these 
Physical Phenomena constitute an interconnected system of obser- 
vations which in turn fits in well with other observations (rigidity 
pr bodies, measurement of distances) the criterion is revised to 
Obtain the best possible fit (the highest possible correlation) be- 
‘ween all these sets of phenomena." 


* Tt should not be imagined that physical measurement of even such simple 
ич Well-understood variables as heat and length is without difficulties d 
sna analogous with those we experience in psychology. Thus а pye o- 
gical testing we often encounter the phenomenon of ‘threshold an СОЛЬ 
elects, i.e, of tests which measure effectively over a certain range, but either 
m to measure the variable in question at all outside this range, or oe i 
lassed or meaningless results. In physics, we encounter the same dii d: ty. 
us temperature is defined in terms of the resistance of a platinum wire зү 
— 183° С. to 660° C.; from 660° C. to 1063? С. it is defined and measured by 
the electro-magnetic force of a standard thermocouple. At even higher tempera- 
tures, definition and measurement are in terms of the intensity of radiation of a 


8.5.Р. G 
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In the same way, ‘clinical’ observations of certain types of 
behaviour gives us our original hypothesis regarding the existence 
of a continuum of ‘neuroticism’; these observations, regularized 
and formalized in legal and psychiatric practice, form our original 
criterion. There is no doubt that this criterion is inaccurate, fallible, 
unreliable, and impure. One person's judgments of ‘neurotic’ and 
‘normal’ are not necessarily identical with another’s, and may 
deviate considerably; external factors such as intelligence, or 
beauty, play an important part in our judgment, as do internal 
factors such as the state of our liver, or our general attitudes. But 
roughly these assessments of ‘neurotic’ and ‘normal’ can be shown 
to correspond with certain objective tests, such as body-sway 
suggestibility, or persistence. As these objective tests constitute an 
interconnected system of observations which in turn fits in well 
with other observations (employability of mental defectives, effects 
of prefrontal leucotomy, selection of nurses and students Eur 
zygotic and dyzygotic twin differences) the criterion is revised to 
obtain the best possible fit (the maximum correlation) between all 
these sets of phenomena, Empirical evidence for these statements 
will be given in later chapters; here we are concerned merely with 
the general theory underlying scientific methodology. 

The method of criterion analysis will become clearer as its 
various applications are «studied as they occur throughout this 
book. In essence, it is nothing but the application to 5, special 
taxonomic problems of human behaviour in its E e P Ya 
aspects of the general principles of scientific method a П-СО8) icol 
larly of the hypothetico-deductive method, It x lore par a 
hypothesis regarding the existence of a quantitative со Ч 05 
underlying a given area of human behaviour: it en олш Ч 
sidiary hypotheses regarding the nature of this con TON, hich 
will determine the tests to be used, and the Ns oe aa w. ЫН 
of the relations anticipated. - - - Deductions made fen ae, 
of hypotheses can be disproved (an example of this will be given 


defined wave-length from а ‘black body’. As Scott. pla; n~: 

interest lies in the fact that these conventions зше ров ош, ‘particular 
it so happens that pure aluminium freezes (“becomes slides ambiguity, since 
using the iesistance thermometer, thus falling within th ) at just over 660° С. 
thermocouple, whereas when the thermocouple і € official range of the 


Tmometer is required: 
accurate freezing point for aluminium’ (Scott-Blair ЭЭС унан to quote ай 
› 5 
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later in the book), or they can be supported with varying degrees 
of evidence by the results of specially planned experiments. We 
must now turn to the empirical evidence, to test the value of our 
procedures by application to a variety of hypotheses. 


Chapter Three 


THE NEUROTIC DIMENSION: OPERATIONAL 
DEFINITION 


hypothesis of the existence of a general factor of neuroticism ог 
stability. These terms, which are intended to denote the extreme 
points of what is conceived of as a continuum, will be used inter- 
changeably according to the particular direction implied in any 
particular case, just as one speaks of intelligence and mental 
deficiency depending on which of the two extreme points of the 
cognitive continuum one may be concerned with. A diagrammatic 
statement of the hypothesis is given on page 52; it suggests 
immediately the necessity for a formal disproof of what we may 
call the null hypothesis, i.e. the hypothesis that our two extreme 
groups (normals and neurotics) are not in fact objectively dis- 
criminated at all, but are merely chance and random selections 
from a homogeneous population. To many readers, disproof of the 
null hypothesis will appear merely pedantic and completely un- 
necessary, because this hypothesis goes so much counter to common 
sense and current psychiatric teaching; however, as we noted in 
our discussion of the evidence available with respect to the effects 
of psychiatric treatment, current beliefs are not always accurate 
guides to scientific knowledge, and a rigid proof must not be built 

on the shifting sands of hearsay and assumption. 
In citing evidence against the null hypothesis, we have grouped 


the studies quoted into four main divisions: Studies dealing with 


psychiatric ratings, studies dealing with questionnaires and inven- 
tories, studies dealing with objective behaviour tests, and studies 
3 


dealing with constitutional differences. In reviewing this evidence; 


we will at the same time have an opportunity of discussing hypo- 
theses regarding the traits which may be said to a n 
the neuroticism continuum, and to characterize the neurotic 
unstable end.as opposed to the normal, stable end. W 2 = 
able to review some of the evidence which sug з ae а mos 
ratings nor self-ratings can be relied on to ae ui ds nci 5 

Е 84 e reliable, 


I: this chapter we will begin to discuss the formal proof for our 
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Valid, and objective type of evidence which we are looking for. 
No attempt has been made to give a complete survey of the litera- 
ture. We have attempted rather to pick out important landmarks 
relevant to our main hypothesis, and characterized by proper 
research design, adequacy of statistical treatment, and use of a 
sufficiently large number of cases. The principle of selection was 
that one convincing demonstration is superior to a discussion of a 
hundred papers all of them deficient from one point of view or 
another; in cases of doubt, preference has been given to material 
otherwise not easily accessible. 


(1) Ratings and Psychiatric Diagnoses ; 

The first of the studies chosen to illustrate this section was 
Carried out by Reyburn and Raath (1950), and involved the 
rating by 83 observers of two subjects each. The total experimental 
Population consisted of 160 ratees, evenly balanced with respect 
fo Sex and University education. Inter-rater reliability ona few 
subjects who were rated by two observers was 806. Ratings were 


Оп a five-point scale, covering 45 well-defined personality traits; 


ГЕ actual ratings were gone over in each case with the rater by 
derstanding of the 


Ээ of the experimenters to ensure proper un 
Borles used, 

he table of intercorrelations was subjected to several methods 
oF analysis; of particular interest here is the oblique solution which 
resulted in six factors which were not independent of each other, 
and the correlations between which clearly gave rise to higher- 
Order factors. These were not derived by the writers, but were 
calculated from their Table V for inclusion in this book. Thirteen 
iterations were required before the communalities began тозсо 
verge. The results are very clear-cut, and are set out in Table XIII 
below. It will be seen that the first factor, which was rotated to 


TABLE XIII 
FACTOR SATURATIONS FOR ORIGINAL AND ROTATED SOLUTIONS 
I II Т Ir | La 
1. Spontaneit = :575 — +461 414. :387 
y —:230 575 Ё 

8, Stability м — 428 -072 -000 2:945 
3. Persistence :554 —:215 :592 "950 353 
4. Assertiveness 621 -789 210 :9/8 © roor 
5. Sensitivity дот Ei 


— :340 —-218 +163 
5 | 372 


6. Inferiority —-472 — 384 — 256 —:554 
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pass through ‘stability’, has a saturation of -972 for that item; this 
identifies it as the opposite end of the ‘neuroticism’ factor with 
which we are concerned. It is reasonable to find that persistence is 
closely associated with stability, and that assertiveness also has a 
positive projection on this factor. Inferiority feeling and sensitivity 
are understandably loaded negatively with stability, as is also the 
factor ‘spontaneity’. 

The second factor opposes ‘assertiveness’ and ‘spontaneity’ to 
‘inferiority’ and ‘sensitivity’; this falls in with our conception of 
introversion and extraversion respectively. We thus get a very 
interesting confirmation here for our view of ‘neuroticism’ and 
‘introversion-extraversion’ as second-order factors in the orectic 
sphere, corresponding to Thurstone’s intellectual second-order 


factor in the cognitive sphere. Figure 9 gives a diagrammatic 
representation of the results. 


ASSERTIVENES. 


SPONTANEITY 


SENSITIVITY 


INFERIORITY 


Figure 9 
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The other main research to be discussed under this heading is 
one reported in three papers by Mayer-Gross et al. (1949); Slater 
(1947) and Rao et al. (1949). A total of 201 neurotic and 55 
Mal officers were rated by a psychiatrist on thirteen main 
E aviour pointers’, which are defined at some length in the 

riginal publication. Ratings were confined to noting either the 
73 in or the absence of the particular trait to be rated. A brief 
ЇР ment of the traits used is given in Table XIV, together with 
E. Proportion of cases showing any of the traits in either the 
mal or the neurotic group. All the differences observed were 


5e TABLE XIV 
n " 21 и Гас. 
Personality ‘pointer’ rated К ине Шаша р pesti 
i | Neurotics | Normals E 
1, Heredit SM 2241 qe | - 
et 4 | cm 76 2:13 
3 Physica] ill-health 18 т | 23 Ps 218 
4. Folic traits in childhood 45 | "6 82 | 462 
5. Sh ег Psychiatric illness . СОР | ш 7| 345 
6. ЙА Solitary, etc. in childhood 40 ul 74 3:96 
7. Em culty in making social contacts | 33 -07 ба 3:28 
& ational instability “73 -09 141 4°42 
ð sessional features -37 13 67 174 
E pbPrehensiveness -59 -02 117 5:25 
ir, үү Pendence | 2 ЕЯ 105 161 
12, ОЕ Work record | +29 -07 43 Kp 
iage or sexual difficulti Е ds AM 
13. Alcoholism оо ине ЗЕ. ыг TE 57 
Er eee = : Шын кел = = ; 
assu Values of p’ Уп obtained directly from a matrix of 2 х 2 tables on the 


™Mption of a general factor and two group factors. 


shown to be significant by means of a chi squared analysis. Also 
= ven is а column headed ‘D’, showing the absolute differences 
11 Occurrence of each ‘pointer’ between the normal and the 
оно groups; this column will be referred to аз the ‘difference 
Column’, > 

. ^ table showing the frequency of concomitance of pairs of 
таш Was prepared, and factor analysed directly, i.e. without con- 
Version into correlations first, as would be the more usual pro- 
cedure. This analysis resulted in one prominent general factor, and 
two relatively unimportant subsidiary factors. The general factor 
saturations correlated 2 79 with the ‘difference celumn', thus 
leaving little doubt about the identification of this factor as one of. 
‘neuroticism’, or ‘Constitutional adequacy’, as the writers prefer to 
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сай it. The other factors are identified as ‘episodic instability’ and 
‘shyness’ respectively. Scores for these three factors were derived 
very simply by noting the number of pointers of each class for each 
individual, thus obtaining three scores (A, B, and C) for each 
individual. These three variables can be considered as defining a 
space of three dimensions so that any particular individual can be 
represented by a point in such a space. Groups of individuals, as for 
instance the clinically-diagnosed syndromes of obsession, hysteria, 
anxiety state, or post-traumatic personality change, can then be 


represented by a cluster of points around the mean value of that 
cluster. 


As the neurotic members of 
had been diagnosed in these у 
test the hypothesis that the m 
were collinear in the three- 


the sample used by the investigators 
arious categories, it was possible to 
can values of the groups in question 
dimensional test space; in other words, 
hese syndromes could be accounted for 


: € neurotic population in the hysteric- 
psychopathic (extraverted) obsessional V üntro- 
verted) type of personality, i t the main division 
Suggested by the 4 i 9 


Psychopathic group on the one hand 
group on the other. 


€, and only 5 in the 
the Suggestion may be 
original 13 scores into 
Ow existing differences 


group. Alternatively, 


rough condensation of the 
3 only, too much information was lost to all 


OPERATIONAL DEFINITION 89 


to appear clearly. Or again, the original choice of variables, made 
in order to maximize normal-neurotic differences, may not have 
been adequate for bringing out differences within the neurotic 
group itself. That one of these explanations is likely to be true is 
indicated by our success in differentiating between introverted and 
extraverted neurotic groups by means of psychological tests. 


IE 
ч OBSESSION 
PERSONALITY 
5 CHANGE ANXIETY 
NORMAL б9 STATE 
e Qh HYSTERIA N 
5 © 40 ! 
ro 20 50 PSYCHOPATH 
Figure 10.—Configuration of Diagnostic Groups 
TABLE XV Ё 
MEAN VALUES OF FIRST TWO CANONICAL VARIATES 
Group | Ay As 
4 | 
Normal “99 | 16 
Personality Change "79 | 36 
xiety State 2:22 23 
Hysteria» 12°32 A 
Psychopathy | 313 E 
Obsessional 3:36 | 62 


These ratings were made fost hoc, as it were, and dealt with 


Subjects who had already broken down; consequently they САҢ 
ardly escape the criticism that they do not furnish us with n 
ОГ predictive validity. Unless a rating can be made before break € 
Which will identify the potential neurotic the value of the metho 
must remain in doubt. Evidence on this point 18 given by two 
important studies Бу W. A. Hunt ef al. ( 1949), using what they 
call the ‘historico-experimental’ method. Their experiment was 
designed to test the hypothesis that Naval neuro-psychiatric 
screening was efficacious in lowering the subsequent rate of neuro- 
Psychiatric attrition during service, and was based oz the premiss 
that *the more neuropsychiatrically unfit individuals who are 
removed from a sample of recruits during the recruit training 
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period, the smaller should be the number of medical surveys for 
neuropsychiatric reasons among that sample during their later 


military service . . . if neuropsychiatric Screening performed its 


assumed function, an inverse relation should exist between the 
recruit screening rate at the Naval Training Station level, and the 
subsequent rate of neuropsychiatric attrition during service’. 
Three training stations were selected in such a way that recruit 
populations and professional psychiatric competence of interview- 
ing staff were roughly comparable; the major difference between 
the stations lay in the differential discharge rate for neuropsy- 
chiatric reasons, based on different attitudes towards psychiatry 


held by the Commanding Officers of these stations. The results of 


this study are clearly shown in Table XVI; it will be seen that in 
the stations which tended to act upon the psychiatrist's recom- 


TABLE XVI 


г 


; s 25 
Percentage Percentage Discharged 
N Discharged Percentage | z я 


Training Station” 


| 
dering ie arged | Subsequently by Years E 
Training J 1943 1944 1945 
Great Lakes | 1525 | 45 I y | " 
Aport 15544738) | 126 18 : | 4 3 
ampson | 2823 a d 3:0 6. | б 1:5 
| COE NL, _ a 
Total s 
Ss | 9521 a 22 24 5 7 DI 


stations are very similar. alth i f 
z : i 5 ough in one О 
em only half as ma d during training 


— 
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in World War II has been validated, and that the rational argu- 
ment for neuropsychiatric screening has been confirmed by experi- 
mental findings’. With this statement the present writer is in full 
agreement; the importance of these studies in proving the possi- 
bility of identifying the potential neurotic before he is exposed to 
the stress that leads to later breakdown can hardly be exaggerated. 
However, while it can thus be shown that psychiatric rating 
procedures have a certain amount of validity, complacency is 
hardly in order when we look at certain results published by the 
nformation and Education Division in the U.S. War Department 
(Stouffer, 1949). These results have a bearing on the question of 
25 bility rather than on the question of validity, except that 
Validity, in the long run, cannot outstrip reliability; as far as they 
БО, however, they show a picture of unreliability which must cause 
Опе to pause before accepting ratings as conclusive evidence in 
relation to the type of problem we are discussing here. A brief 
discussion of some of the data on which this judgment is based will 
now be given. 5.40 
During August, 1945, 42:3 per cent of all literate preinduction 
€Xàminees were rejected, 14 per cent for psychiatric reason 5:6 
Per cent were rejected with the diagnosis of ‘psychoneurotic , 39 
Per cent with the diagnosis of ‘psychopathic’, «3 per cent with the 
diagnosis of ‘psychotic’, and 4:2 per cent for other psychiatric 
seasons. When rejections are analysed by induction station (there 
Were fifty-five of these) the percentage of psychiatric rejections 18 
Ound to be very unstable, ranging from 55 and 3:8 per cent at the 


one end to 50-6 per cent at the other! In other words, for aee Hed 
inductee r ejected for psychiatric reasons in one centre, there a 


qr one hundred rejected in another. The plausible атаса 
ша perhaps the figures represent à faithful picture of the es 
Incidence among the populations drawn into these Модон in ps : 
Чоп stations js rejected by the writers of the роња о ЭМ e 
Convincing reasons why this is most unlikely. We are F ЧЭ 
unreliability of psychiatric assessment as the most pro а wee 
Among rejects, there are equally large differenc йы 
‘tations with г espect to the different diagnoses. With id с à n d 
39:9 рег cent being diagnosed as psychoneurotic оа: 
"ejects, the percentage varies from 2°] to 90 го per hei» о 
Stations! Percentage of psychotic diagnoses varies on 2 Ч 
`О per cent. Percentage of psychopathic diagnoses varies кош. 70:9 
О 2:0 per cent. Such differences again must be put'down to 
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unreliability of psychiatric classification, and while it is true that 
the psychiatrist was not given much time in these stations for 
arriving at a diagnosis, this handicap should not affect the relative 
incidence of different forms of disorder.1 | 
It might Бе objected that in the study just quoted, psychiatrists 
were working under a disadvantage, due to being under heavy 
pressure and not having sufficient time to arrive at an accurate 
diagnosis, as well as to being forced to use diagnoses уу 
not be in line with their special training or with their theoretical 
outlook. We may therefore quote briefly a study of psychiatric 
reliability in which the dice were, as it were, loaded in favour of 
the psychiatrist (Air Ministry, 1947). A number of air crew who 
had been referred for examination by a psychiatrist, had in actual 
fact been seen by two psychiatrists at intervals of a few weeks; both 
psychiatrists made a report on the case which included a three- 
point rating on his neuroticism (‘predisposition to neurosis’). 
Unfortunately, the two opinions were not independent ; the second 
psychiatrist had available to him some part of the opinion of the 
first psychiatrist to study the case, thus 
probability of agreement, Also, the у 
consideration (neuroticism) 
this sample than in an unse 
rated as having had severé 
frequently encountered in unselected 


hich might 


hand, presumably 


» 19429) should be consulted in this 
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conditions, are rather disappointing. Total misclassification (dis- 
agreements of one observer with the other) is 31 per cent. The 
Coefficient of contingency is -459; when this is corrected for the 
small number of categories in order to make it comparable with a 
product moment correlation, the value only rises to +534. This 
value is far below anything that would be considered acceptable 
1n connection with a psychological test, and when in other parts 
of this book correlations are run between psychological tests and 
psychiatric criteria, it must be borne in mind that even under 
favourable conditions this criterion has a reliability of only :534, 
and that consequently even a perfect test of neuroticism could not 


correlate higher than 4534 = -731 with this criterion. 

, While it might be objected that a diagnosis is not the same 
thing as a rating, it is in fact difficult to draw a line between the 
two; a rating of ‘emotional instability’ is often de facto equivalent to 
à diagnosis of *neurotic? in certain situations. On the whole, these 
arge-scale data bear out the widely-accepted and well-docu- 
mented view that valid and reliable evidence of the personality of 
the person rated is difficult to obtain by means of ratings, and 
should never be accepted without empirical verification (Vernon, 
1988). Ratings may give suggestive results, as for instance in the 
important attempts of Cattell (1946) to isolate the fundamental 
Source traits in the ‘personality sphere’, but as Cattell himself 
Would be the first to agree, it is not until we go from the sphere of 
ratings into the field of testing congruence of ratings with objective 
measures that our findings are worthy of general acceptance. 

atings, fundamentally, are the product ofan interaction between 
two Personalities—rater and ratee—and as long as the former 
Cannot be regarded as forming a constant part of the total situa- 
Чоп, so long will the final rating be partly an indication 2 a 

* This i i ing a perfectly reliable an 
Perfectly valid tor amd пазі chins Judgments are perfectly 
2 apart from their unreliability. In practice we can E pem is 
"On by using, not a single test, but a whole battery ч m E ы. 


from the intercorrelations of these tests a factor of пешо 


pue 

the limits : à iable estimate of a person's ‘true 
of nd reliable es Б : 

sampling errors, a valid a: to note that in a previous study 


Neuroticism Itisin М б T Hon 
. teresting, in this connection, to NO” 
along factorial lines the reliability of the psychiatrist’s jugara Pm E 
E the inverse of the above equation for the were Vote ker of г. 
'st’s rating, i.e. using the psychiatric rating's known vali 4 g 
Actor saturation to find He нусан reliability (Eysenck, 1947). The 
Validity was -ут, the reliability was «712 = 550, which agrees admirably with 
the figure found empirically in the R.A.F. study under discussion. 


PROPORTION RECEIVING 


E 
“12. A more d 
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rater's personality as well as reflecting to an unknown degree the 
personality of the subject who is being rated. 


(2) Pencil and Paper Tests 


(a) Questionnaires.—The evidence in this field shows quite clearly 
that under suitable conditions questionnaire responses can be relied on 
to give excellent discrimination between normals and neurotics. 
One example has already been given on page 56, where it was 
shown that fifteen separate questionnaire-type scales gave positive 
correlations of varying size with the normal-neurotic criterion; it 
was also shown that the intercorrelations of these scales followed a 
pattern which could be shown by means of criterion analysis to 
support the general hypothesis of a general factor of neuroticism or 
stability advanced here. The actual discrimination obtained by the 
use of a questionnaire based on the first of the fifteen scales des- 
cribed is shown in Figure 115 it will be seen that when a cutting 


EACH SCORE 


SCORES 


Figure 11 


score of 9 15 used, only 10:6 per cent of neurotics are misclassified 
im 8, and 28-6 per cent of normals misclassified as neurotics- 
Е use P p кое gives a slight improvement, but the 
in would not be sufficient to compensate fi ity of 
i tlm p or the complexity 


Similar results may be quoted from many different sources: 


Typical of questionnai in thi 

nnaires used in this connection 1 Mauds 

: : : 53 ae 
Medical Questionnaire, CXVIII 


€, which is reproduced below (Table ХУШ) 
Ж XVIII are given the scores (number of we борын x 
ie oa and 1,000 neurotic (discharged) members of H.M- 
A 68) these figures are shown in diagrammatical form in Figure 

f answers to individual items i$ 


: эд? ctailed comparison o 
iven in Fi i 
g igure 13, which shows, percentage endorsements О 
« 


OPERATIONAL DEFINITION 95 


normals and neurotics respectively for the sixteen most discrimin- 
ating items of the scale. 1 


TABLE XVII 


THE MAUDSLEY MEDICAL QUESTIONNAIRE 
NEM... а erccvespsccrnivrvessenscnca АВМУ МО ERR E 
Ly, Read through these questions and underline the correct answer, either 
es’ or ‘No’. Do not omit any item. 
It is important that you should be quite frank. 


(1) Do you have dizzy turns? Yes No 
(2) Do you get palpitations or thumping in your heart? Ya R 
es No 


(3) Did you ever have a nervous breakdown? 


(4) Have you ever been off work through sickness a good deal? Yes No 

(5) Did you often use to get ‘stage fright’ in your life? Yes No 

(6) Do you find it difficult to get into conversation with strangers? Yes No 

2 Have you ever been troubled by a stammer or stutter? Yes No 

(8) Have you ever been made unconscious for two hours or more X 
es 


by an accident or blow? 
(2 Do you worry too long over humiliating experiences? Yes No 
0) Do you consider yourself rather a nervous person? Yes No 


(1 1) Аге A z Yi No 
your feelings easily hurt? 2 са 

is Do you usually keep in the background on social occasions? Yes No 

13) Are you subject to attacks of shaking or trembling? ын RE 
es 


1 hoe 
4) Are you an irritable person? 


и) Do ideas run through your head so that you cannot sleep? Yes нэ 
( ) Do you worry over possible misfortunes? Yes 9 
17) Are you rather shy? 8 Yes Мо 
(18) Do you sometimes feel happy, sometimes depressed, without 
(1 апу аррагепї геаѕоп? ? Үе: ЫН 
(9) Do you daydream a lot? Yes No 
(29) po you seem to have less life about you then others? S Nó 
(22) De you sometimes get a pain over your heart? Үе Мо 
(23) De you have nightmares? Yo No 
(24) О you worry about your health? : Ye Мо 
24 Науе you sometimes walked in your sleep? Yes No 
5) Do you sweat a great deal without exercise? Yes No 


(26) Do you find it difficult to make friends? 


(27) Does your mind often wander badly, so that you lose track of з. UNO 
(28) What you are doing? Yes No 
(29) = you touchy on various subjects? Yes No 
ies Do you often feel disgruntled? Yes No 
) Do you often feel just miserable? iors? | Yes No 
3) ре you often feel self-conscious in the presence of superiors: SONS 
О you suffer from sleeplessness ? H 
33) os ever get short of breath without having done heavy Yee. No 
Work? 
: Yes N 
(34) Do you ever suffer from severe headaches? 5 Yes No 
ER АО you suffer from ‘nerves’? e . "^ Yes No 
€ you troubled by aches and pains: к 57 
(87) Do you get ders fs places ши as lifts, trains ог tunnels?. _ ts No 
89 Do you suffer from attacks of diarrhoea? Xs NC 
39) Do you lack self-confidence? Ta Yi No 
4 € you troubled with feelings of inferiority: a 


= 
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Li 
TABLE ХУШ 
Score Normal | Neurotic 
0-2 118 | 55 
8-0 171 34 
6-8 187 | 2 
9-11 165 66 
12-14 120 | 80 
15-17 101 81 
18-20 65 96 
21-23 | 2 | 123 
24-26 21 | 12. 
27-29 12 | 144 
30+ І | 145 
| | 


Ауегаре: 9:98 20:01 


SCORE 
0—2 
5—5 
6— 8 
9—u 
12 — 14 
15 — 17 
18 — 20 
21 — 25 
24 — 26 
27 — 29 
30+ 


1000 NORMALS 1000 NEUROTICS 


Figure 12.—Distribution of Scores 
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Administration has also published some of the results achieved 
through the use of neuroticism questionnaires; Figure 15 shows a 
typical picture of the differentiation achieved between normal and 
neurotic merchant seamen (Killinger, 1947). There is little advan- 
tage in multiplying instances; it is hardly possible to deny that 
unselected subjects make scores markedly different from those 
made by subjects diagnosed as ‘neurotic’ by psychiatrists, and 
separated from the service. 


100 со 
Well-adjusted 4 
58 juste | A 
——-—- Doubtful neurotic 451 
80] ------ Discharged neurotic 250 


70 
60 
50 
40 
30 
20 


CUMULATIVE PERCENTAGE 


50 28 26 24 22 20 18 16 14 m 102 88 6 4 2.10 
TEST SCORE 


Figure 14 


у hat neurotics give answers 
if that were so it would not be: 
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questionnaire results (Eysenck, 19475). But as we have shown in 
de preceding section, the validity of psychiatric ratings for future 

reakdown has been established; it would seem to follow that 
methods correlating highly with psychiatric assessment would also 
show a certain degree of validity. 


40 « 


55 


E 
5 $ 


PERCENT FREQUENCY 


15 55 9-5 15-5 17:5 215 255 19:5 535 37-5 AUS 455 495 555 575 615 65:5 
INVENTORY SCORES 
Figure 15 
aire results within an unselected 
Population can be shown to be related to independent variables 
Such as age and education in precisely the same way in which 
these variables are related to the incidence of neurosis. ‘If we 
examine the age-education distributions of the cross section of 
white enlisted men... and the hospitalized psychoneurotics 
. from among their number . . . it is at once apparent that-there is 
an excess of psychoneurotics in exactly those classes in which the 
Anxiety Symptoms Index (the questionnaire used in these studie: ) 
was shown to select relatively higher proportions—among older 


In the second place, questionn 
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men and among the less well educated men’ (Stouffer, 1949). This 
striking agreement argues strongly in favour of the general 
validity of the questionnaire results. 

In the third place, we have in addition to such indirect evidence 
as has been presented above, direct evidence regarding the fore- 
casting efficiency of questionnaire items, and followed up over a 
period of six months. А sample of 73 men who had had a ncurotic 
breakdown subsequent to their test was compared with a sample of 
730 men who had had no such breakdown subsequent to their test, 
and who were matched with the ‘breakdown’ group for age, 
education, and marital status. The results of this experiment 
(Stouffer, 1949) show very clearly marked differences in question- 
naire responses between the experimental and the control group. 
This result is all the more interesting as typical neuroticism ques- 
tions were intermingled with questions dealing with у 


arious aspects 
of morale; the former (e.g. E 


ЖЭ ‘In general, how would you say you 
feel most of the time, in good spirits or low spirits? or ‘What kind 
of physical condition are you in?) were far superior in differentiat- 
ing the two groups than were the latter (e.g. ‘In general how well 
do you think the Army is run?’ or, ‘In general what sort of time do 
you have in the Army?"). It seems clear, then, both from our dis- 
cussion of the indirect evidence and from the к i чл 
quoted, that questionnaire responses do not onl ceder 
between acknowledged neurotics and normals eni j i dem i 2 
responses сап be used to single out the prospectiv also t at = E 
a group of unselected recruits. e neurotic iro 
This general conclusion is in а 


ables when the d are adequately ologically significant d 
responses.’ In a similar vein, Ellis (1946 give can 

of оше four hundred veliti seat) уне his review 
applications of personality inventories Һауе yield ae Military 
able results to command attention. In contrast, рс cnough favour- 
tories in civilian practice have generally Proved dp ity ven 
Regarding the use of questionnaires in civilian - isappointing. 

that ‘the older, more conventional, and more va Y os he holds 
these tésts seem to be, for practical diagnostic ely used forms of 
worth the raper on which they are printed Purposes, hardly 
scatement he presents a summary table showing Support of this 
studies in non-military groups very frequ 8 AN validation 


е 
ntly show ‘negative 
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validity? : 
d Iso dee ie to be his somewhat surprising way of 
Н Nnm B wd coefficients are below + :40. While his use of 
усш d ын is unusual—a validity coefficient of + -69 
that he hist 1 y rs “questionable positive'—here is little doubt 
work are less ela А іп showing that questionnaires in civilian 
This еба 1 үлеп in military research. А . 
thie “оог results obtained with questionnaires underlines 
БЕДЕР conditi results on the motivation of the subjects, and the 
"олон л under which the testing is carried out. As long 
so long will pem nor conditions can be adequately controlled, 
Dres “as, anv е impossible to regard questionnaires and inven- 
regarded with шы! тайга than supplementary evidence, to be 
set of oa unless shown to be valid in the particular 
ider биге eu which happens to be under investigation. 
questionnaires e eee as the figures quoted above show, 
under less мыт е of very great utility and scientific value; 
Oath, егеу}. rable conditions, validity coefficients may approach 
lee How a sc е negative. As in the case of ratings, it is difficult to 
such an entific analysis of human personality can proceed on 
unreliable, shifting basis. 


315 Neurotics—both sexes 


E 
200 Original standardization group 


ооо Middle-class sample—both sexes 


233 Middle-class sample—both sexes 


143 Industrial workers—male 
126  Nurses—female 
[M=10-7 Jo =61 _] 


341 Industrial Workers 
Members of Guards Regiment— 
ЭВТЭЙ 5 

Navy continuous service person- 


[M=irg To 56019 
ncl—male 


2,562 Total Sample a - 


Figure 16.— Description of Sample 
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(b) Word Connection List.—This test, which is essentially an 
‘alternative choice’ version of the word association test, is repro- 
duced in Table XIX. Of the two responses offered for each 
stimulus word, one is more frequently chosen by neurotics, the 
other by normals (Crown, 1948). Total score is the number of 
neurotic responses underlined. The validity and reliability of this 
test has been established on large numbers of different populations, 
and representative means and standard deviations are given in 
Figure 16 to illustrate typical results. These values, which are taken 
from Crown's (1951) paper summarizing work on this test, show a 
marked difference between the neurotic and the normal groups. 
Further results will be quoted later on to substantiate the claims 
made for this test. 

(с) Annoyances, Interests, Worries, etc—Many pencil and paper 
tests make use of lists of items which might be considered annoy- 
ing, worrying, interesting, or in some other way related to the 
emotional life of the subject; the number of items underlined or 
crossed out is then taken as the score. This type of test pioneered 
by Beesey an his Х-О test, is less direct than a questionnaire, but 
more direct than the word connection list; consequently its useful- 
ness would appear to lie between these two other methods О 
eliciting information through pencil and paper responses. The 
experimental study to be described was carried СҮН Bennett 

(1945) and Slater (1945),0n 80 normal and 80 neurotic subiects 
The following tests were used: ч тойс subjects. 

т) A neurotic inventory divided into t : 3 
ЯГ ‚бе m related to the clinical S ee E) 
and depression. (2) An annoyances test, listin НЕД ү 2 
annoying stimuli or situations of four kinds, fi bd possi à 
whole in random order. These four types of hn. each, t e 
are: (a) frustration of self-assertion, (b) persona] mh situation 
dirt or untidiness, (9) noise. (3) Three sections of the tea 2 
test, modified from the original, and dealing with. (а) ressey X— 
which an individual should be blamed, (b) things е iner for 
has ever felt worried, nervous or anxious, (c) items ЛЕ Pus he 
or in which he is interested. Scores on these ten ier he likes 
obtained, and intercorrelations established for the aay were 
separately, ds well as for the combined group of Pid groupa 
neurotics. P3serial correlations were calculated between шоо an 
neurotic dichotomy and each of the ten tests; th € normal- 


: ese are reproduced 
‘D’ in Table XX. Also given are first-fac produce 
under tor saturations for 


` 
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TABLE XIX 


WORD CONNECTION LIST 


SURNAME 


follow. = 
ed Бу two words in small letters. Look at the word SINK in the example 


glance at the two other words. Does the word SINK make you 


think m 
ore of * ror й = 5 
wash’ or of ‘drown’? Draw a line under whichever word is more 


connected in your mind with SINK. 
EXAMPLE: 


5 
En ome people connect *wash* 


no ri; 
апо 
underline 


SINK 
lows: with SINK and so they underline ‘wash’ аз 
еЗ wash drown. Other people connect ‘drown’ with SINK and 

cy underline ‘drown’ as follows: SINK wash 


Hu 
Ы Ч J dinde answers because one 
st look at the two words that follow the word in capital letters and 


the one that you fecl is more connecte 


wash drown 


drown. There are 
word connection is just as good as 


d in your mind with that word. 


ORK Е 
K FAST. Don't stop to think long about any опе word. Be sure not to 


сау 
с any words out. 


50188085 nurse cut 
teri fect moist 
D yell soft 
OMAN girl 


t 
LION й rouble 


^ sentence 
5 Sad sky 
ee food ache 
THO home right 
Яс UGHTS ideas 
BO w beware fast 
Ris girl mischief 
oxen little tall 
er happy discontented 
UNEA. shameful wand 
Т РРҮ по усз 
IY dry drink 
SINE drinking fasting 
useless good 
EM weight heart 
ROUBLE lawyer 
DEEP ocean hurt 
FRIEND  double-crossed 
FOOT hand tingle 
MAN work woman 


strange 


sorrow 


close 


WEIGHT scale losing 
DIGNIFIED snobbish poised 
TALKED spoke about 
SLEEP nightmares bed 
RIVER lake danger 
BABY foundling little 
LOSE find mind 
SALMON dislike like 
HUNGRY thirsty heart 
MAN hard boy 
MUTTON eat flesh 
SWIFT hurricane slow 
BRING take disaster 
SWEET affected bitter 
FOOD stomach poisoned 
RAW deal meat 
PARTY crowd myself 
BITTER medicine swect 
GRAVE serious funeral 
WOMAN excitement man 
SOUR lemon stomach 
САМ”Т concentrate fly 
PINT quart whiskęy 
NEEDLE drug sharp 
WOUND  bandages icelings 


Remember: There are no right or wrong answers. Go as fast as you can 
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the normal, the neurotic, and the combined groups. Statistical 
tests showed the feasibility of combining the two groups, by com- 
paring the hypothetical matrix derived from the combined ш 
with the correlation tables for the separate groups (f = “10 in cac 
case). і 


ТАВІЕ ХХ 


First Factor | 


чу 
| Neurotics Normals | Combined 

Inventory: Anxiety | -77 | о | 37 2 
Hysteria 49 | *55 | 53 43 
Depression -78 75 70 7° 
Аппоуапсез: а 18 03 13 :25 
b ao | we | му 49 
с 15 | '21 | 13 “10 
d 63 | 48 “54 “72 
Pressey X-O: a е -06 mE eds 319 
Е 33 55 "49 :65 
с °9 2-2 -18 +56 


It will be seen that there is a high correlatio 
criterion column (‘D’) 


whether separate or с 
This fact may be tak 
the hypothesis of a g 
fact that the two mat 
subjects show patter 

Standard deviat 


n between the 
and the first factor saturations of the groups, 
ombined (r — -64 for the combined values). 
еп as another indication of the correctness of 
enera! factor of ‘neuroticism’, as can also the 
rices derived from the normal and the neurotic 
ns of'intercorrelations substantially identical. 


ions of summed weighted standardized scores 
were estimated separately for neurotics and r 


normals, giving values 
of 2-17 and 1-74, an insignificant difference statistically. As Slater 
comments, ‘this is an interesting finding. If neurotic personality 
were a unitary characteristic persons selected for the possession of 
it would have more homogeneous scores of tests which measure 1t 
than unselected persons; instead they are found slightly, though 
not significantly, more heterogeneous. Here we have an argument 
independent of criterion analysis which essentially leads to the 
same conclusion of the quantitative nature of neuroticism, 25 
opposed to the qualitative concept. i 
(d) Food Aversions—The common psychiatric observation that 
neurotics terd to have special emotional attitudes to food has been 
quantified hy Wallen (1945) in the form of a food aversion list. A 
list of twenty foods is read out to the subjects, and aversions, 48 
well as the reasons for dislikes, are recorded, Alternatively, printed 
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questionnaires may be used; there appears to be little to choose 
between the two methods. Wallen has shown that neurotics report 
on the average a significantly larger number of food aversions than 
do normal recruits. 

Gough (1946) applied this list to 79 neurotic and 254 normal 
soldiers, and found mean number of aversions of 5:14 for neurotics 
and 1-23 for normals, a very significant difference statistically. 
Less than 5 per cent of the normals disliked more than three foods, 
while about 70 per cent of the neurotics did so. Table XXI gives 
the actual food names used, the percentages of normals and 
neurotics disliking each, and the CR. of each difference. The most 
Interesting feature in this table from our point of view lies in the 
fact that the dislikes of normals and neurotics are highly correlated 
(rho = -82); in other words, foods disliked by neurotics are also 
disliked by normals, but not to the same extent, and similarly, 
foods liked by normals are also liked by neurotics, but not to the 
same extent. Here, then, we have again support for the quantita- 


tive view of the difference between normals and neurotics. 


TABLE XXI 


PERCENTAGE OF AVERSIONS FOR NEUROTICS AND NORMALS 


ON EACH OF TWENTY FOODS 


Percentage of * Percentage of Critical 
Food Neurotics Normals s Rilo 
Disliking Disliking 
Tea =. go 45 
Grapefruit juice iba 43 йо 
сап зоцр 34-2 55 6: 
Potato soup 15:6 53 8:9 
Salmon 31:6 8-7 541 
Beefsteak r3 27 1:8 
(cal chops 20:3 24. 5:6 
hicken 8-9 12 3:5 
Fried eggs 16-4 1:2 59 
Cottage cheese 51:9 14:6 g 
Swiss Cheese 36:7 8-7 19 
ime beans (broad beans) 240 6:7 43 
abbage 354 12:2 47 
orn Ee 1:6 43 
Mushrooms 53:2 25:2 E B 
Pane 20-6 08 |. "E 
Omatoes 11:4 24 s 34 
antaloupe 13:9 3:5 s а 4 
herries 13:9 0 0 
Ав, 8-0 8 3:9 
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(3) Objective Behaviour Tests 


In a sense, the major part of this book deals with objective 
behaviour tests of one kind or another, and it may seem Nu 
to add a special section dealing with them here. However, ast 3 
type of test is still relatively unknown to many psychiatrists d 
clinical psychologists, and neglected almost completely in t A 
practical work of the clinic, the hospital, and the workshop, 1 
seemed worth while to present briefly some of the main иш 
ments this type of test has received Бу psychologists like Luria ап 
Davisin the field of motor control, Hull in the field of. suggestibility, 
Ryans and others in the field of persistence, and various members 
of the writer's department in several other fields. No attempt will 
be made to duplicate the summaries given in Dimensions of 
Personality of these various areas ; only one major research will be 
quoted in each case to establish landmarks, as it were, to guide 
later interpretations of factorial studies. 

(a) Suggestibility—It has been shown elsewhere 


that the most convenient, most reliable, short test of this personality 
trait, established in two factorial stud 


ies, was Hull’s Body Sway 
test. In this test the subject, standing with his eyes closed, his hands 
hanging down by his side, and his feet together, is made to listen 
to a gramophone record which repeats 


the words: ‘You are falling, 
you are falling forward, you are falling forward all the time. You 
are falling, you are falli 


ng, you are falling now. .. 2 His maximum 
forward or backw 


(Eysenck, 1947) 


the amount of sway, and the 
» Increases with an increase in the 


amount of body sway in inches 
the number of cases in each 
li each column. It will be seen 
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severe neurotic groups swayed 5:55 and 6-72 inches respectively 
for the two sexes. This study is but one of several showing a strong 
correlation between neuroticism and suggestibility. In none of the 
studies, incidentally, were any marked differences noted between 
hysterics and other types of neurotics. 


7 
6 
5 


4 


SUGGESTIBILITY: 


ot 
960 NORMAL I п nt улуу, yt 
MALES (N-60) (N-54) (N-132) (N=247) (N=244) (N-154) (N=69) 
6.72 


SUGGESTIBILITY: 
© 


" vr 

390 NORMAL d... DOES 80220475 ane 
FEMALES: 08:80) (N=13) (н=54) (N90) (N=100) (nasa) (219) 

Figure 17.—Average suggestibility of Nc 
Increase in suggestibility correlated wi 


ormals and Neurotics, showing 
€ ЕЕ 
th increase in *Neuroticism 


(b) Manual Dexterity.—W hile tests of this type have usually been 
regarded as tests of ability pure and simple, it has beeni shown that 
Very large and significant differences сап be observes between 
normals and neurotics with respect to Various tests in ‘his group, 
such as the О?Соппог Tweezers test (Eysenck, 1950), for instance, 
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or the U.S.E.S. tests (Lubin, 1951 )-: Correlations E a яс 
normal dichotomy are in the neighbourhood of +. ч 0. єг "i wt 
also appear to be deficient in these tests, but their pa vr be: 
disability is very different from that of the neurotics, as wi 
эс там Aspiration: The general technique of level of мэ 
tion tests requires the repeated performance ofa task (раша y a 
mechanical one) which can be carried out with varying ск 
speed, accuracy, or goodness; it is essential that the váriable саяр 
should Бе expressible in numerical terms (number of pue | 
number of errors, etc.), and that the subject should not be ap 1 
from his performance to estimate with great precision his actual 
score.* The subject is made acquainted with the task; he is b 
asked to give an estimate of how well he expects to do on his nex 
performance. (This estimate must be made in quantitative terms, 
of course.) This estimate is called his ‘aspiration score’. He then 
performs the task, and is asked to give an estimate, again И 
numerical terms, of how well he has done; this estimate is calle 
his ‘judgment score’. He is then told what his actual score on the 
performance was (his ‘performance score’), and asked to estimate 
‚ how well he will do next time. This procedure is repeated a aa 
of times—usually from five to ten repetitions are required to Бе” 
reliable data. Two main scores are derived from these data: 
t pancy score. This is based on the difference 


rmance on trial 
(X + 1); it is positive if the as 


1 These and several other tests used at various times are illustrated 20 
photographs 19, 20, 23, 25, 26. 


2 The most frequently used task in our 
illustrated in pho 


handwheel so as 
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linear rather than linear. We find that when the neurotic sample 
is broken up into the introverted (dysthymic) and extraverted 
(hysteric) groups, these groups show large differences. Introverts 
have high levels of aspiration (high positive goal discrepancy score) 
and severely underestimate their past performance (high nega- 
tive judgment discrepancy scores). Extraverts have low levels of 
aspiration (low positive or even negative goal discrepancy Scores) 
and show no tendency to underestimate their past performance— 
indeed, they may overestimate it (low negative or even positive 
Judgment scores). Normals in either case are intermediate between 
these two extremes, having medium high positive goal discrepancy 
Scores and medium high negative judgment discrepancy scores 
(Eysenck, 1947; Himmelweit, 1946; Miller, 1951). Figure 18 gives 
a schematic drawing of these relations. It follows from what has 


- been said that differences between a normal group and a neurotic 


group would depend on the proportion of hysterics and dysthymics 


HYSTERICS 4 NORMAL! NORMAL DYSTHYMICS 
lEXTRAVERTSIINTROVERTS 2 
NN uin 
Мєн 0 "LOW MEDIUM |. HIGH 
TIVE POSITIVE POSITIVE POSITIVE 
Goal Discrepancy Scores. 
HYSTERICS NORMAL | NORMAL N DYSTHYMICS 
«T EXTRAVERTS ! INTROVERTS “р 
Low о LOW MEDIUM HIGH. 
> POSITIVE NEGATIVE NEGATIVE NEGATIVE 


Judgment Discrepancy Scores. 
Figure 18 
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in the neurotic group: unless this is controlled itis difficult to make 
т diction. ^ ^ л , 
Cun odi do normals and neurotics differ with respect to actual 
scores; they also differ with respect to the pattern of scoring. When 
goal and judgment discrepancies are correlated, these correlations 
tend to be positive for normal and negative for neurotic groups. 
As an example, we may take the following correlations, based on | 
male and female groups respectively: + -59 and + :39 for normals, | 
and — -68 and — :39 for neurotics. These differences are presume 
ably due to the peculiar curvilinear relation between neuroticism 
and the discrepancy scores. 7 

(d) Motor Response Disorganization—The view that under 
emotional stress there is a disorganization of motor processes, and 
that this disorganization will be more noticeable in neurotics than 
in normals, is not a novel one, and may be traced in its experi- 
mental development to the early work of Nunberg at the time of 
the First World War, who combined the Word Association test 
with measurement of hand movements of an involuntary kind 
(1918). 
. These early and technically somewh 
were taken up by Luria (1932), 
known to be detailed here. Alth 
Jones (1931), Crosland (1931), Huston et al. (1934), Barnacle e al. 
(1935), Houtchens (1935), Runkel (1936), Burtt (1936), Krause 
(1937), Gardner (1937), Shuey (1937), Speer (1937). Sharp 
(1938), Reymert (1938), Morgan and Ojeman (1942), Albino 


(1948), and particularly Clarke (1950), have shown the essential 
correctness of the general thesis thr 


the example given here is { 
what different work of D 


at unsatisfactory дээ 
in a series of researches too We 
ough others, such as Olson an 
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n those of an aircraft, but the test was apparent; 
subjects as an exercise in instrument flying. ШЗ, 
d of a series of four Inànceuvres, together occupY. . 
3 » repeated between intervals of straight and leve — 
flying. Ошу the periods of manœuvre Were scored; instructio? d 
and recording were so arranged that a perfect performance b 
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recorded. аз а straight line, any deviation indicating an error. 
Other detailed records of performance which were necessary were 
also made. 

Two types of error were observed; errors of overaction and errors 
of inertia. Subjects exhibiting overactivity ‘obtained large scores on 
control movements. Their errors in instrument reading were small 
and of short duration. ... Responses to instrument deviations 
were excessive, the extent and gradient of the movements being 
greatly increased and over-correcting frequent, with the result that 
secondary responses were required. Numerous restless movements 
were observed. . .. Subjects felt excited and under strain, tense 
and irritable and sometimes frankly anxious. They felt that correc- 
tion was urgent and made it impatiently. . . . Although... sub- 
jects were dissatisfied with their performance, they were not dis- 
couraged, but keen to improve and continued to try to do the test 
well. 2 Several pilots reported preoccupation by the test for 
some time after it was finished. Some returned later in the day and 
asked for a further opportunity of doing the test." 

Subjects exhibiting the inertia reaction made ‘errors which 
were large and of long duration, whereas activity, represented by 
scores of control movements, was relatively little. . . . The indi- 
vidual responses were less hurried and less disturbed by restless 
movements than were those of the pilot: showing the overactivity 
reaction, but they were often more extensive than at the beginning 
of the test, due probably to the larger size of the instrument devia- 
tions and the tendency to make responses proportionate to the size 
of the deviations. .. . Subjects reported that their interest had 
flagged and that their concentration had failed. A feeling of strain 
had now given way to one of mild boredom, tedium or tiredness. 
. . . In contrast to the restless striving of the overactive class, the 
pilots in the inert class gave the impression that they had lowered 
their standards of performance to a level well within their powers. 
- ». Subjects’ judgment of the degree of accuracy to which they had 
attained was usually faulty, and they were unaware of the degree 
to which they had failed to correct deviations of the instruments. 
... There was... ап emotional indifference as far as the test 
itself was concerned.’ А й Ё 

These results are closely in accord with Nunberg's observation 
in his original work, which led him to posit the existence of two 


- types of individual, the one showing inhibition under affect (the 


inertia reaction), the other exhibiting increased excitability (the 


112 THE NEUROTIC DIMENSION: 


overaction response). They also link up remarkably all жи 
experimental work on the extravert-introvert dimension repor * 
in Dimensions of Personality (Eysenck, 1947) showing that mds : 
(like the overactive response type) tend to show anxiety, ын ү 
irritable, to have a high level of aspiration, and to be dissatis x) 
with their performance. Extraverts (like the inertive response type е 
tended to show emotional indifference, to have low levels Е 
aspiration, adjusted to actual performance, and to be satisfie 
with their performance, however poor. f 20 
"These experiments thus give rise to two main hypotheses whic 
admit of proof or disproof: (1) Pilots who are neurotic should show 
a larger proportion of abnormal (overactive or inert) reactions 
than pilots who are not diagnosed as neurotic, and (2) pilots who 


' are diagnosed as suffering from dysthymic disorders (anxiety state) 


should show the introverted, overactive behaviour pattern, while 
pilots suffering from hysteric disorders should show the 2 
verted, inert behaviour pattern. Both these hypotheses were teste 


by Davis on 355 normal and 39 neurotic pilots. Taking the com- 
parison between normal and neurotic pilots first, it will be seen 
from Table XXII that both predictions are verified: 75 per cent 0 


TABLE XXII 
TEST RESULTS OF NORMAL AND NEUROTIC PILOTS 


Y Reactions Total 
N 
Normal Overactive Inert 
ВЭ? 
No. 9 No. 9 2 9 
Normal Pilots 268 5 5 59 5 28 % 355 
Neurotic Pilots 13 33 її |. 98 15 38 39 
Acute Anxiety State 6 43 7 20 I 7 ү} 
Нузїепа I | 12:5 I 12:5 6 75 5 
Other Pals souls | т 8. iu Е 


the normals show normal reactions, 
neurotics do so. 
while 75 per cen 
ally significant. 
division would 
of subjects tó 
experiment, 
gave results 


but only 33 per cent of the 
7 per cent of anxiety states show inert reactions; 
t of hysterics do so. These conclusions are statistic- 
The number of neurotics is зо small that any sub- 
appear unlikely to contain sufficient large numbers 
make conclusions very impressive, Repetition of the 
using a modified and much simpler type of apparatus, 
which show again, at an acceptable level of statistic? 


9. Dark-vision test. 


10. Cambridge Triple Tester, used for Level of Aspiration experiments. 


12. Tracing board. 
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significance, that the acutely anxious (introvert) type of patient 
tends to make more extensive responses than did the normal 
subject, while the hysteric (extravert) patient tends to make less 
extensive ones. This apparatus is illustrated in photographs 13 and 
14. The task consists of aligning the central pointer with the line at 
the right or at the left, according to the brightness of two lights 
flashed on at both sides. Movement of the pointer is mediated 
through an integrating disc, and is produced by turning the hand- 
wheel. Provision is made for automatic recording of movements, 
time sequences, etc. (Of. Davis (1948).) 

Our hypothesis of a normal-neurotic continuum would also 
require that within the normal group there should appear a correla- 
tion between neurotic predisposition and percentage of abnormal 
responses. The percentage of abnormal responses made by the 355 
normal pilots is given in Table XXIII after they had been divided 
in three groups according to psychiatrically assessed degree of 
neurotic predisposition; it will be seen that there is an increase in 
abnormal responses from 16 per cent in the group showing least 
predisposition, through 27 per cent to 46 per cent in the group 
showing the greatest amount of predisposition. Thus this third 
hypothesis is also confirmed by Davis's results at a reasonable level 
of statistical significance. We may take it as established, therefore, 
that under emotional stress there is aodisorganization of motor 
Processes, a disorganization which is closely correlated with 
neuroticism or lack of emotional stability. 


TABLE XXIII 
ASSOCIATION OF NEUROTICISM AND TEST SCORE 


Predispositi Percentage 
сонц il Normal Overactive Inert “Abnormal 
ei ү 130 13 12 16 
Mod, t 113 32 D 27 
oderate 25 14 7 46 


(e) Body Control.—Several experiments on static ataxia, the 
eath rail walking test,! and other tests of body control have 
shown that neurotics are notably inferior in this respect. Results 
Shown below in Figure 19 give data on a simple static ataxia test 


` With 120 normal and 1,230 neurotics, in which instructions were 


1 Illustrated in photograph 18. 
hot ustrated in photograp | : 


r d ” 
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si sti xed, with the eyes closed and the 
ly to stand still and relaxed, wi 3 

ЭН hanging down the side, feet together. The test lasted for 

30 seconds only, and maximum sway either forward or backward 

‘was the score of the subject who was being tested. 


N RO- 


22 NORMAŁ GROUP NEUROTIC GROUP 


Figure 19.—Static Ataxia Test of Body Controls N= 1,230 


Т (f) Rorschach Test. —1t is possible to treat the Rorschach test 104 
exactly the same manner as any other objective test, by transform- 
ing responses into ‘scores’, and dealing with these scores statistic- 
ally. Several examples will be given in later chapters; here we will 


quote one investigation to illustrate the procedure adopted. The 
example chosen is a factori 


om our original evidence of neuroticism W25 
based. While this ; to a greater extent than 
; nevertheless these students ha 


European modes of living. 


: c 1 es us to form an opinion ОЛ 
 animportant theoretical point raised by Burt (1915). In his analys! 


_ of personality, he places much emphasis on a trait of ‘genet 
emotionality’, which in the orectic field assumes an importance 
reserved in the cognitive field for the general factor of intelligenc ` 
This factor was first reported in 1915, ina Paper submitted to the 


based on data from 172 normal child: 
hi e =: woe ЗИ 


ү, 


7 


h Association. ‘The analyses, - 
dren and 157 normal adults; . 


“ p» Lg - 
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led to the hypothesis ofa common factor underlying all the primary 
emotions, which was termed “general emotionality” (by analogy 
with “general intelligence”).’ In a recent paper Burt (1950) has 
taken this notion up again, and has contrasted the factor of 
‘emotionality’ with the factor of ‘neuroticism’. ‘A number of 
investigators who have carried out factorial studies on neurotic 
cases have found a similar general factor. Many have inferred that, - 
since this factor is common to all the traits assessed for such groups, оо 
it can therefore be designated a “general factor of neuroticism”. — 

. .. But a general factor for neuroticism would be a factor that is _ 
found among neurotic cases only or among traits peculiar to 
neurotics; and to prove such a conclusion it would be essential to 
examine a control group of normal persons, and demonstrate that 

the factor in question is not found among the normal.’ 

This argument is rather curious, as the general factor of 
neuroticism has always been presented as constituting a continuum _ 
ranging from the most extreme instability to the most marked. 
stability on the other side; to imagine that it should appear only 
among the neurotic would be equivalent to saying that a general o 
factor of intelligence should appear only among the intelligent, and 
that to prove such a conclusion it would be essential to examine 
а control group of dull persons, and demonstrate that the factor 
in question is not found among the dull! However, arguments 
In science are not usually as convincing as experimental demon- 
strations, and Sen's experiment does enable us to reach a con- 
clusion, if only a provisional one, regarding the nature of this factor. 

` Her subjects were given an individual Rorschach test, a verbal 
and a non-verbal group test of intelligence, and Cattell’s test of 
fluency. In addition, each subject was rated independently by two 
Judges for the following traits: Intelligence, verbal ability, imagin- . 
ation, perception of relations, general emotionality, extraversion- 
dep Introversion, assertion-submission, cheerfulness-depression, soci- 
ability, anxiety, neurotic tendencies, and extent of vocational and 
cultural interests. Ratings were on a fifteen-point scale, inter-rater — 
reliabilities ranging from :51 (0570, with an average of +71. Scoring 
of the Rorschach was carried out, using Beck's procedure with two. ~ 
| minor changes. A slight modification of his ‘Z-score’ was intro- 
duced, and Klopfer’s ‘form level’ score was used in addition. — 
Correlations between scoring categories and the mean intel- .- 
ligence test score are relatively slight, only four beirg above the — — 
_ 9 Per cent level of significance for both men and women. These 
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four categories are: Good form (46 and 44 respectively), move- 
ment response (-41 and -42), modified Beck 2 score' (41 and 42), 
and ‘form level’ score (-50 and :59). To obtain evidence on correla- 
tions with non-cognitive ratings, a factor analysis was performed 
on tetrachoric intercorrelations between 36 Rorschach scoring 
categories. Three main factors were extracted, contributing 29, 19; 
and 7 per cent to the total variance. Approximate factor measure- 
ments were calculated for these three factors by reducing the 

original scores to standard measure, and then weighting them with 
the saturations for the three factors. As a last step, in order to 

interpret the factors, these factor measurements were correlated 

with the tests and ratings, which thus form an outside criterion. 

The first factor is one of associative fluency. This is shown by 

. the fact that its highest saturation is for total s 
(+96), and by the fact that its only significant correlations are with 


аз a strongly-marked and importan 
at all, it would appear to be slight 
and through it with extraversion-introversion, 
tively in Dimensions of Personality. But the c 
fluency and emotionality ( :25) is too 1 
importance, and no other signific: 


as suggested tenta- 
orrelation betwee? 
OW to be of any systematic 
ant correlations are reported for 
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general emotionality. Consequently we may conclude that this 
experiment has failed to bring forward any evidence in favour of 
Burt’s claims, and that instead the results fit in extremely well with 
the general scheme of personality description developed here. 


(4) Constitutional Differences 


The type of study reviewed here deals with tests and measures 
which are related not so much to behaviour, but rather to qualities 
of the organism which underlie such behaviour. In the literature, 
this type of work is frequently listed under the general heading 
of ‘physiological psychology’, because constitutional factors are 
obviously closely related to the physiological make-up of the 
person. It is primarily on the European continent that constitu- 
tional research has flourished, and it may be worth-while to 
emphasize the main difference between the physiological and the 
constitutional approaches. In the writer's view, this difference is 
one of emphasis. To the physiological psychologist, reactions are 
segmental, and to be studied as far as possible by excluding any 
facets of the personality not immediately relevant to the ostensible 
problem. То the constitutional psychologist, reactions are not 
segmental, but always integrated with the whole personality, and 
therefore useful in throwing some light on the person who is 
reacting, as well as on the particular neural or muscular mechanism 
which is being studied. To the physiological psychologist, dark 
vision is a segmental phenomenon, to’ be studied in terms of 
physiological variables, vitamin A deficiency, and the like; to the ' 
constitutional psychologist, its interest lies primarily in its high 
correlation with emotional instability and behaviour disorders—a 
Correlation to be discussed below. These two approaches are not 
antithetical, but complementary; the phenomena studied have 
more than one aspect and need to be studied from many different 
points of view. ` 

(а) Body Build (physique).—In Dimensions of Personality it was 
shown that a correlation existed between body build and neuroti- 
Cism. Factor analysis was used in establishing two main factors in 
body build: (1) A general factor of body growth or body size, and 
(2) a type factor distinguishing between growth in length and 
growth in breadth. An index of body build was based on this 
Second factor, ` : А 

LB Stature x 1000 
“Transverse Chest Diameter x 6 
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than. the normals who showed a larger number of mesomorphs 
(index of body build within one + S.D. of 100) and eurymorphs 
(thick-set, broad body build, more than one S.D. below тоо.) It was 
also shown that hysterics tended to be significantly more eurymorph 
than dysthymics, who were markedly leptomorph in body build. 
These results were taken from work on men only. Since then, 
further work has been carried out on women by Rees (1950), 
showing that intercorrelations of body measures among them also 
give rise to two factors; diagrammatic representation of these two 
factors—body size and body type—is given in Figure 20. (An 
index of body type for women was derived from the saturations of 
the various measures for the type factor, giving I.B.,, = :59 Stature 
++ 747 symphysis height — -31 chest circumference — -64 hip cir- ‚ 
cumference.) Again, hysterics were found to be more frequently 
characterized by eurymorph body build, dysthymics by lepto- 
morph body build. And again, the more leptomorph subjects 
tended to be more severely neurotic. It may therefore be regarded ~ 
as established that body build is related to the general neuroticism и 
factor, both in теп and in women. 3 
(b) Dark Vision.—Neurotics have poorer dark vision than e 
normals, as has been demonstrated in several independent studies. - = 
Figure 21 shows the result on the Livingstone Hexagon test ofthe _ 
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scores ofsome 6,000 R.A.F. personnel and 96 neurotics; a score of o 
denotes complete failure in the test situation, while a score of 32 
denotes maximum success. The marked difference between the 
normals and the neurotics springs to the eye (Eysenck, 1947). 


`- (c) Autonomic Imbalance.—The hypothesis that neurotics are 


characterized by imbalance of the autonomic nervous system finds 
its strongest support in a monograph by Wenger (1948), following 
up his earlier work with children. Abstracting only a few of his 
many important findings, we reproduce in Table XXIV C.R.s for 
autonomic tests used by him on 488 normal and 22 5 subjects 


TABLE XXIV 


I III 
II IV 
Жи Factor г. joes 
Item 3 E cod Oferaliona 1| N gna 
rJ ormai . ve 
E Group ча Neurotics 

10. Salivary Output 2-86 25 — 8-13 
12. Salivary pH 2°50 — 48 423 
13. Dermographic Latency 1:00 u7 — :38 
14. Dermographic Persistence +93 II —0ї “Or 
15. Palmar Conductance 4:84 | "19 45 "47 
17. Log Conductance Change 1:06 31 -23 3:36 
19. Volar Conductance 1:43 "19 4 оо 
21. Systolic Blood Pressure 652 | — 37 4°46 
23. Diastolic Blood Pressure e 8-60 30 - 4:60 
28. Heart Period 7:82 *бо “386 6-65 
30. Sublingual Temperature 2:50 | 47 45 2:00 
32. Finger Temperature 3:00 03 =A02 3°81 
44. Tidal Air Mean | 592 — оо 2:47 
46. Tidal Air Sigma 42 15 - 1:58 
48. Oxygen Consumption :Во “13 -35 3°43 
52. Pupillary Diameter “00 == -00 “00 


у him from intercorrelations of the 
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'- C.R.s. The correlations between columns 1 and 2 in the table, and 


between columns т and 3, аге -55 and -29 respectively. Both values 


rise somewhat when subjected to criterion analysis, but even with- | 


out this refinement it is clear that the factors existing within each 


group are essentially similar to, and identical with, the autonomic - 


imbalance factor which causes normals to differ from operational 
fatigue cases, who from our point of view may be regarded as 


‚ falling towards the ‘unstable’ end of the neuroticism factor. 


_ The fourth column of Table XXIV gives C.R.s for a compar- 
ison on the same set of tests between a normal group of 488 
subjects, and a group of 98 neurotics. This set of C.R.s is propor- 
tional to the one given in column 1, and also to the factor saturations 
in columns 2 and 3; the correlations are respectively 551, 42, and 
‘38. These values are sufficiently high to support the general 
hypothesis that autonomic imbalance, operationally defined 
through this set of tests, acts in a unitary manner and distinguishes 
between normals and neurotics. - 

y We have now shown that when we take large groups of persons 
diagnosed as ‘neurotic’ by competent psychiatrists, and compare 
them with large unselected groups of persons who have never come 
under psychiatric supervision, and whom we may for convenience 
label ‘normal’, large differences appear along each of the four 
шаш types of approach we have outlined. In ratings based on 
interviews, large differences are observed with respect to heredity, 
neurotic traits in childhood, shyness and difficulty in making social 
contacts, emotional instability, apprehensiveness, dependence, 
marriage and other sexual difficulties, physical ill health, unstable 
apne record, and former psychiatric illnesses. In self-ratings 
Obtained from questionnaires, large differences are apparent with 
respect to items referring to dizzy turns, palpitations, worrying, 
nervousness, easily hurt feelings, shaking and trembling, irritability, 
nightmares, sleeplessness, shortness of breath, lack оѓ self-confidence 
and feelings of inferiority. On objective behaviour tests, differences 
are marked on suggestibility, manual dexterity, level of aspira- 
tion, motor response disorganization, and body control. In the 

eld of constitutional differences, body build (physique), auto- 
nomic imbalance, and defective dark vision were noted as effective 
differentiating tests. This survey of the literature is far from com- 
plete, but it does depict a mutually consistent pattern of personality 
traits which may be regarded as an approach to the operational 

Cfinition of the personality of the neurotic. It also disproves, at a 


- 


2 * 
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every” high level of confidence, the null. hypothesis; there is very 
little doubt that the group of ‘neurotics’ is differentiated from the 

> - group of ‘normals’ on a basis distinctly different from chance. 
2-0 “We have also been forced to postulate that the ‘neurotic’ group 
Shows a lack of homogeneity which cannot be attributed to the 
general factor of neuroticism, and our hypothesis has been that we 
are dealing with a second factor, orthogonal to the neuroticism 
one, which resembles in many important aspects Jung’s extravert- 
introvert dichotomy. The introverted neurotic shows symptoms 
of anxiety, depression, and irritability; he has overly high levels 
_of aspiration, overly high judgment discrepancies 
response disorganization of the ‘overactive’ type. 
of the leptomorph body build. This general sy 
called ‘dysthymic’, to avoid the faulty associatio 
term ‘psychasthenic’ (Janet, Jung) would call up. The extraverted 
neurotic is characterized by „hysterical conversion symptoms; he 
has unduly low levels of aspiration, unduly low judgment dis- 
crepancies, is subject to response disorganization of the "inert 
type, and tends to be of the eurymorph type of body build. This 
general syndrome is identified with the ‘hysteric’ classification. 
Figure 22 gives a schematic picture of the relation of these two 
factors (neuroticism and extraversion- 
ally defined by our tests and measures 
In this volume we are concer: 
only; extraversion-introversion 
“Dimensions of Personality, 
nection with it will be re 
tioned here for two reasons: (т) 
- ш соппесііоп with level of aspirati Y i 
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‚ is subject to 
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ndrome we have 


„„'° May conclude this brief survey by noting that all the 
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nce quoted is strongly in support of our heuristic hypothesis 
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Chapter Four 


THE NEUROTIC DIMENSION: OBJECTIVE 
MEASUREMENT 


qualitative and quantitative aspects of a problem tend to inter- 

act and that advances in the qualitative analysis of a complex 
field tend to be followed by improvements in quantitative measure- 
ment, which in turn lead to advances in qualitative analysis. 
Having in the previous chapter established the existence of a 
dimension of neuroticism and defined it operationally, we must 
now go on to discuss the possibilities of an exact quantitative 


measurement, and the reliability and validity of such measure- 
ment. 


It is clear from the general approach adopted in this book that 
what we are concerned with is the ‘internal’ validity, i.e. the 
multiple correlation between a set of tests defining the factor of 
neuroticism with that factor. In addition, of course, we have an 
‘external’ criterion, namely, the diagnostic distinction between 
normal and neurotic groups. The internal criterion, as has been 
pointed out in the second chapter, is defined partly in terms of the 
external by maximizing the correlation through rotation of the 
former; the external criterion, in turn, is modified in terms of the 
internal by selecting those clinical sub-groups in the general field 


of neurosis which maximize the correlation between factor and 
external criterion. In order to sh 


interact, we will, in this chapter, 
out with particular reference t 
problems which has not hitherto 


[: is a well-known principle in scientific procedure that the 


£ than six months’ se 
far as possible), who had clean disciplinary records, and who, on 
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the Matrices test of intelligence, had service grades other than 4 
ог 5. (This eliminates all those with I.Q.s of go or below.)! 

The ‘neurotic’ group consisted of soldiers who were being dis- 
charged from the Army on neuropsychiatric grounds. Each of 
these had been seen by one of four psychiatrists, who diagnosed 
them under the three headings of hysteria, anxiety state, or psycho- 
pathy. No psychotics were tested and the psychiatrists agreed not 
to send patients with service grades of 4 or 5 unless they felt 
on clinical grounds that the patients were more intelligent than 
indicated by their test records.? 

Certain deviations from the original plan, as outlined above, 
took place which must be noted. Regarding the controls, it was 
found impossible to obtain a large enough sample not contain- 
ing re-enlistees, i.e. persons who rejoined the army after yarying 
Periods in civilian life. About 50 per cent of the normal subjects 
actually were rc-enlistees, and if Ње assumption is justified that 
persons who rejoin the army in this fashion may be regarded as 
having failed in their adaptation to civilian life, and are therefore 
less stable and less well adapted than the average, then we must 
conclude that our normal group was slightly biased in the neurotic 
direction. Regarding the neurotic group, it was found on analysing 
the data later that the criterion requiring those who took part in 
the experiment to have selection grades of 3 or upwards was not 
fully followed. This led to there being a marked difference between 
controls and neurotics with respect to intelligence, the neurotics 
being very significantly inferior. (The exact data will be given later.) 


1 These men were sent from the Woolwich Garrison, and our thanks are 
due to the Commandant of the Royal Artillery Depot for having given per- 
mission for these men to be tested and for his interest and help in the project. 
The main work in connection with the administration of the project was carried 
out by Dr. S. Crown. Testing was carried out by Drs. A. Clarke (1950) and 
A Gravely (1950), whose theses contain a detailed account of their work, by 
J. Standen, Dr. A. Lubin (1951), and G. Yukviss. To all these the wrifer is 
indebted for their skilled and competent contribution. In the description of the 
Sroups tested and the general set-up of the experiment the account given by 
Clarke (1950) has been followed very closely, and often verbatim. The number 
of cases used in Clarke’s and Gravely’s theses is smaller than that used in this 
chapter as the research went on beyond the point when these two. started 
Wórking up their results. In consequence some tests attained levels of signifi- 
cance they fell short of with the smaller number of subjects. о 1 

2 Our thanks are due to Colonel Pozner for his invaluable help in making 
these arrangements and carrying them through, as well as to the other psychia- 
ttists who took part in the selection procedure. 
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Another important reservation attaches to the diagnoses, par- 
ticularly those of hysteria and anxiety states. In the many doubtful 

~ cases, one important variable factor is the personality bias of the 
interviewing psychiatrist. When there is some doubt as to the 
actual diagnosis, some psychiatrists will lab 


el a patient an hysteric, 
who in other hands might be called an 


anxiety state, and vice 
o made the diagnoses, 
es; thus, A had a strong 
ght tendency to diagnose 
the cases came within the 
action to stress, but had a bias for 
D when confronted with a doubtful 
lety state. The actual tabulation 
ur psychiatrists, strongly bore out 
uently it is very doubtful whether 
en hysteria and anxiety states can 
.be taken very seriously. We will give test results separately for 
reader may be able to judge for 
do obtain between the different 


tendency to diagnose hysteria, B had a sli 
hysteria, C thought that a good many of 
normal limits of emotional re 


d out by members 
artment and by severa] Ph.D. 
almost continuously for two full 
cals and rest pauses. The tests used 
© next section, together with the main 


test for the various groups. Also given 
tion ratio between 
controls, 


students, Each subject was tested 
days, with suitable breaks for m 
_ will be briefly described in th 


ty state. 
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carry out the testing in such a way that every subject completed 

every test. The cause lay partly in the breakdown of apparatus, е 3 
the breakdown of the time schedule, late arrival of transport, and ` 
similar causes not dependent on the personality of the subject, and 
therefore not likely to have contributed any bias to the testing — 
procedure. Together with the data relevant to each test are given, — ~ 4 
therefore, the number of cases in each of the groups involved. The +- 
first two variables: (1) physical age in years on day of testing, andas 


«2 


(2) service grade on matrix test of intelligence, will show the way - 


in which the data are set out. Ду 
NI Е 1 c : "от Biserials 
G) Age | Normas | Ана» | кушшщ | Рон | a, ен.) шт x 
States | | paths | Ratio Eg н.| Hv 217” 


No. of Cases 207 47 48 | 302 95 
сап Score | 25:54 | 23:36 | агча | ЗЭ” RIA 
Variance 36:01 | 31:37 | 23:00 <i 
- LÁ ^ 
| Point Biserials _ 


+ 
T > | Correla- 
(2) Intelligence | Normals | АЛЧУ | piysteries | Pocho- | A, ен. tion юу 
States | Ratio A: g H. 


No. of Cases 


05 38 40 24 78 +506 283 | 78 


° 
Mean Score 2 8 | A й Я 

Б 2:84 “10 18 42 | 41 —48! оло : 
Variance 0-88 To E то |, iu | об к; ар 
Eie | | | le 


ж 


Тһезе figures show that, contrary to our intention, the normal = 
Sroup is differentiated from the neurotic group by being slightly ~ 
‚ Older (a point which does not greatly matter as the difference is 3 
extremely slight, although statistically significant), and also a good 
deal more intelligent, as indicated by the higher score on the 
Matrices test. This failure to equate the two groups for intelligence _ 
has been pointed out before and the way in which it affects the 
results will be discussed later. It is possible that the matrix scores 
4 Which determine the S.G. grading of the soldiers may not have 
peen as valid a test as might be supposed, because in another test 
of intellectual capacity, the Mill Hill Vocabulary test, the differ- - 
ence between the normals and the neurotics, while still significant, 
15 considerably reduced. The figures for test 3, the Mill Hill 
Vocabulary, are given below. Е 


1 _ 1 A comparison of the ratio of verbal to non-verbal intelligence test results 
p anxiety states and hysterics confirms our previous finding that the ratio is - 
ow for hysterics (Eysenck, 1947; Himmelweit, 1945). 


g^ о» y › 
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"p | Point Biserials 
(8) Vocabulary | Normals | Anxiety | нудайх | Pocho- | 4 ep py E v > 
States paths Ratio |" es | Hiv. А. 
No. of Cases | 205 45 | 47 25 2 1313 | °97 92 
Mean Score | 15:56 12°99 | 11:12 | 11:04 | 12:03 te Nag. -'154 
Variance 24:24 | 39:04 | 35:00 | 46-13 | 37:46 | | 
E | ! — ING 


We are now ready to discuss the differences observed with 
respect to the various tests making up the battery proper. The first 
group of tests to be discussed is made up of personality inventories, 
questionnaires, and other written tests of the type described in the 
previous chapter. They are (4) the Crown Word Connection List: 


(4) W Corel Point Biserials 

д) Word Anxiet ics | Psycho- oreas 

ti Normals | 5 9 | Hysteric у 14. ёН. tion A 

"Lit n orma’s | States ЭЗ1661 paths |4 Ratio vA H| Hv As 
— Я і 225 | E — 
No. of Cases | 189 45 45 25 90 :316 297 90 
Mean Score | 12-36 15°51 | 17:38 | 16-76 | 16-44 = | —:295 139 
Variance 


48:94 | 4097 | 53:69 | 45:33 


The Maudsley Medical Questionnaire: 


+ 


Point Biserials 


аха о. 5 LI Correla- 
(5) M-M.Q. | Normals | Ansiety | Hori, рим |4.€ H. бот X UN m 
a р E ч 
No.of Cases | 195 41 4 25 85 -596 280 85 
s Mean Score 10:30 | 23:24 | 20:44 10:04 | 21-79 | == —':596 | —:169 
ariance 42:87 | 52:50 83:57 | 67-92 69:76 


The Minnesota Multiphasic Personality Inventory, Hysteria 


Scale: 
К Correli Point Biserials 
(б) MMPI. | Normals | Anxiety | ну, DEUS. ае Ориг 
Ratio ae H.| HvA 
No. of Cases | ооо 45 44 25 89 451 8 8 
Mean Score 1342 | 2042 | 18-06 | 18:32 1965 | == E os - do 
Variance t 26:06 | 42-75 49:38 | 48-89 | 46-12 
= 

‘The Minnesota Multiphasic Personality Inventory; O Score 

(this 18 the score on those items on the hysteria scale in which 


Supposed to answer in the s 


o 


ame direction as normals): 


| 


14. Maudsley adaptation of choice-behaviour unit. 


15. Luria apparatus, with 


automatic visual stimulus presentation. 


16. Dynamometer persistence test. 


D 
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0) MMPI. 5 ique легап 
"m Anxiety га | Psycho- i] 

Su атан | “States | Hotes | Бр А ӨН.) fion | TM 
No. of Cases | ооо 45 44 25 89 :237 ^ 289 89 
Mean Score | 658 | 487; | 518 | 508| 5:02 | == | 227 +046 
Variance 12:44 | 12:39 | 11:46 | 11-83 | 11:82 | 


The Minnesota Multiphasic Personality Inventory; Lie- 


Score: 
| а I "m Point Biserials 
| 8) Lie $ Anxiety | guae, | Рф an 
| (8) Lie Score | Normals ‘States | Hysterics paths A. & H. уу Я Mer | Hv 
No. of Cases | ооо 8 ` 
45 44 25 9 105 
Mean Score | 3:06 | 2-80 3°39 | 2:56 | 3:09 | 
Variance 3:96 3:48 6-24 | 3:26 | 4°88 | 


Annoyances (total number of annoyances picked from a list 
prepared especially for this battery and obtainable from the 
Institute of Psychiatry, Psychology Department): 


сй Point Biserials 
( А Акай " Py" отт! a= 
9) Annoyances | Normals mony Hysterics balks A. & H. к. УМ y| Hv. A 
————— 
No. of Cases го Ч 

5 |47 | 47 | 24 | 9£ | 36: | 299 | 94 
ven Score | 2282| 2681| 26'38| а433| 26-60 md e] 

ariance 100:85 | 149:29 | 105:89 | тогг10 | 126:26 


Social Attitudes; Emphasis Score (this is the number of items 
9n which the individual agrees strongly or disagrees strongly; the 
mventory of social attitudes used has been published elsewhere 

Eysenck, 19474) ): 


Point Biserials 


бене | ua 
(10) Emphasi Anxiet jes | Psycho & H.| tion 
phasis | Normals | {ШУ | Hyserics | "умы | Ae mato |N es | Нч. А. 


No. of Cases 189 40 42 19 82 -148 
сап Score | 1539 | 1828| 1771| 1995| 1799 А 
агіапсе 89:40 | 129:02 | 108-84 | 118-05 | 117:30 


Worries (this is a Pressey X-O type of test especially made up 


8.8.Р, к 
» 


“ 


2 


“ " = ” * 
E 
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for this experiment, copies of which may be had from the Psy- | 
chology Department, Institute of Psychiatry): 


Ka | 


(11) Worries | Normals | “ШУ | Hysteries | Podo- | 4, & H. 
=— 
No. of Cases 170 | 44 43 21 87 
Mean Scorc 1755| 2634| 2463| 22°86| 25-49 
Variance 14928 226-37 | 203:38 | 239:83 | 213:25 


Number of Things Disliked (this is a Pressey X-O type of test, 
the score on which is the number of things disliked. It was especially 
made up for this experiment, and copies may be had from the |) 
Psychology Department, Institute of Psychiatry): E 


ж 3 | Cort Point Biserials 
“| (12) Dislikes | Normals | p | Hysteggs | Tacho- Г OH.) Же Devo ul mea 
—' € її = E 
| | | 
No. of Cases | 177 | 44 | 39 23 85 “заа 260 83 
Mean Score | 13:11 18:73 | 21-72] 1717| 20°13| “== | —:323 116 
Variance 57:53 | 14397 | 19473 | 112779 | 167-99 


. Number of Things Liked (this is a Pressey X-O type of test, the 
Score on. which is the number of things liked. It was especially 
made up for this experiment, and copies may be had from 
the Psychology Department, Institute of Psychiatry): 


Psycho- 


(13) Likes | Normals АЧЧУ | py sterics A. & H. | t = 
2 | | States 58 paths | Ratio v gr. | Hove 
41 | | p i B E 
No. of Cases 184 44 41 25 8 | 071 26 8 
Mean Score | 41:52 | 41550 | 40:44 38-00 | А. 18 Ten oe 037 
Variance | 231°23 | 207-98 | 201-95 261:33 202-92 | 


Discussion of Questionnaire Variables.—1t will be seen that nearly - 
all the questionnaires give a highly significant differentiation 
between the normal and the neurotic groups. The highest ав- 
crimination is given Бу the Maudsley Medical Questionnaire, - 
which has a correlation ratio of -60. One exception to this general 
conclusion 48 the emphasis score, which is insignificant, although 
all the three neurotic groups differ in the expected direction 
(greater emphasis) from the normal group. The other exception is 
of much greater interest and importance; item 8, the Lie Score, 


g в - 
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shows no differentiation between the normal and neurotic groups. 
This finding is of particular importance because the Lie Scale was 
constructed in order to determine whether or not a given subject 
completed the test in such a way as to try and show himself in a 
particularly good light. Such a tendency would be brought out 
very clearly by a high figure on the Lie Score, as has been shown 
experimentally by various investigators. Our finding that normals 
' and neurotics do not differ with respect to the Lie Score would 
appear to go counter to claims often made that the normal group 
would be trying to make themselves appear in a good light, 
whereas the neurotic group would not, and that any differences on 
questionnaires such as the Maudsley Medical Questionnaire would 
be due to this tendency. No such interpretation is possible on the 
basis of our data. We may conclude that both neurotics and 
normals tend to give answers to the best of their ability and that 


these answers discriminate at a very high level of significance” 


between the two groups. ? 
It may be of interest to compare the scores of our groups with 
other normal and neurotic groups tested previously to determirie 
whether they can be regarded as in any way representative. The 
two tests on which most standardization data are available are the 
Word Connection List and the Maudsley Medical Questionnaire. 
The average score for a normal group on the Word Connection 
List is about 10; for neurotic groups it is in the neighbourhood of 
16. Our data would suggest that whilé the neurotic group is 
similar to those tested previously, the normal group is somewhat 
less stable than other groups tested, a conclusion well in line with 
the fact that this group included a large number ofarmy re-enlistees, 
as pointed out above. With respect to the Maudsley Medical 
Questionnaire, the scores for the normal and neurotic groups 
respectively are very close to those of previous samples tested at 
Various times. 'The scores given here may be-compared with those 
quoted in an earlier chapter. On the whole, it seems likely that the 
Eu ups tested in this research are fairly typical of normal and 

€urotic groups respectively. LESE 
The next CAM Es tests i be discussed is made up of objective 
behaviour tests. Four of these, numbers 14, 15, 16, and 17, are 
Manual dexterity tests, M, №, О, and P, taken from, thé United 
States Employment Service General Aptitude Battery. These tests 
are illustrated in several of the photographs in this book (numbers 
19, 20, 23), and will therefore not be described in detail. The 


° 
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results obtained from them are given below, as well as a combined 
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total score, number 181 
| Ї EUN. 3 
| = | Point Biserials 
— om : Я | Correla- — 
M IEEE | ны T AH] er Уем. TM 
| L | Ratio А.Фн| "“^ 
= | == | > — A 
No. of Cases 168 | 3g 37 28 76 | -gi 2 76 
деа Score 7720 71:59 7216 7139 | 7:87 35 80 '033 
auance — | 6072 | 5751 | 96:97 | 74:34 | 55:77 
— ы ! — 
E 2 22-22 po — 
| E уй 
й ls 
(15) Desterity ) ж к " | бин Point Biserial : 
Test No” | Normals Anxiety Hystries | Poe aeg. din ЖЕ к 4 
| ti „у. | Hive de 
Í or ASH. 
No.ofCases 168 | | 
э 37 | 88 | 56 | зав | ай 76 
DA Score 8806 76315 7497| 7335 | 75-58) 22 зог | 94 
с | 101-09 162-50 178-30 | 135°06 168:27 
= - - ——— — ——ÓÀ 
s PT — E -— 
Gp Derigrity | .. : Point Biserials _ 
‘rest G7 | Normals | piety | riter | Parcs а, акы. | Се | —— 4 
Ratio |N ves р) Hive 4. 
— =| amem vm. | * ё —— 
Мо, of Cases | 168 | E "d 
Жы ни, 39 37 23 76 | ca; 2 76 
Маш есе хохо 24°18 | 23:00 | 24/22 23:60 | == 266 4h 
20'1 16:68 | 18-78 | 11-63 17:82 | 
SUE. el ши. 
o 
(17) Dexterity ака, Л, n ns Pide: | nas. Point Biserials — 
Test P States iysterics paths | A. @ H, ton ӨШ OP 4 
| Ratio № H. v 4 
ы | ^ oe Ji CNET A. & H. 
No. of Cases 168 39 PEL pres ie 
Mean Score 25:56 | 22:77 ЗУ 2935 244 16 
Variance | 10:68 | 15:29 | 22-30 12-54 18:44 324 '030 
E 2. d - 
(18) Dexterity 5 Eus 3 1104 
Tes Total | Normals | Bi 1 ЭГРт pore х ad a 
| Ratio |N. v. 
- Ce | atio у. H. v. A 
| | CERO | А. 
No. of Gases | 168 | 39 37 23 76 y pu 
Mean Score | 212-13 | 194-69 | 193716 | 19196 193:95 76 
Variance | 350-71 | 556-64 | 72975 | 451-41 | 63245 | 75534 


1 It should be noted that in order to saye timi 
tered in exact*conformity with the official inst, 
therefore, not be used in comparing our groups with Sth esults given should, 
of the official instructions. The main difference ЇЕЫН ers tested on the basis 
test trials, and in different wording of instructions, n the Curtailment of pre- 


© these tes 


3 ts we inis- 
ructions, re not adminis: 


Test 19 is the Leg Persistence test, scored in terms or the 
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number of seconds during which the leg is held out over a chair 
without touching down (illustrated in photograph 17). 


| 27 Point Biserials 
ч orrela- 
(то) Persistence | Normals | “Anstey | Mysteries | БӨӨР | A. ан.) ti 
a | | 
No. of Cases | 172 38 37 | 23 75 | 
Mean Score | 6017| 4895| 4865| 52°61 48-80. 
Variance 1612-00 


1184-20 |1036:70 


983-80 |1302-60 
| 


Tests 2 


o and a1 are Speed of Tapping tests, i 


n which the sub- 


ject is required to tap on a piece of paper with the tip of his pencil 


for 15 seconds. 


| Point Biserials 


, Anxiety „| Psycho- Carrela- | 
(20) Tapping I | Normals | “Stays | Hysteries | “paths A. & H. im | N Yg ” | mnm 
No. of Cases 167 39 37 29 76 *055 
Mean Score | 69:76 | 7026| 71°89, 6783| 7r05 ! 
Variance | 365°00 | 413710 | 254/70 381-40 | 332:20 | 
"EN DEM 3 — = pee ши a 
Em У] PME TT 

had Anxiety | „| Psych r rela» 
(21) Tapping II | Normals | “States | Hysteries | “paths (4.8 Н. dm AMNEM 
No. of Cases | 168 39 | 37 23 76 7071 
Mean Score | 78:38| 73:59 7541| 7043 | 7447 
Variance 332-00 | 297:30 | 181-10 | 349:80 | 238-40 


Items 22 and 


Anxiety 


23 relate to the amount o 
tapping with a pencil on a sheet of paper. 


Psycho- | д. & H. 


(22) Scatter I | Normals | “States Hysterics | ` paths 
No. of Cases | 172 39 | 37 23 76 2193 248 wo 
Mean Score | 5848| 9333 6784| 4913| 8092 —'154 | 1175 
Variance |4073:00 6338-60 421740 3490'10 (5400:50 

буйд Point Biserials 
Я | rrela- | ү — 
(23) Scatter II | Normals red Hysteries | P. Ped |а ен jon х, pan p 
' No. of Cases | 172 39 37 23 | 76 235 | 248° 76 

Mean Score | 54:36| 10r28| 69:73, 4301. 85:92 —`196 | —187 
Variance 4575/60 8558-80 537490 600-70 7168-50 
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Item 24 is the average scatter on both trials. Scatter is measure У 
in terms of the length of the perimeter drawn around the dots in 
such a way that only convex and no concave lines are admitted. 


X94 


4 Р | сена: 
0 Mean Normals uma Hysteries Todos |а. & H.| d 
= | | ЕЯ, 
No. of Cases | 172 | 39 | 37 23 | 76 | -217 248 7528 
Mean Score | 56:74| 97-69! 68-92 | 46°96) 83-68) == -55128 | == 
Variance 4164-80 7234-00 4593:20 3458-50 6079-60 | | 
| 42:21 pesi Ж Р”. 


Numbers 25, 26, and 27 are scores derived from a Speed of 


Decision test, in which two cards are placed in front of the subject; 


face down, and he is asked to say which of the two will be the 
higher. If he is right he obtains one 


one point; if he says he feels certa 


all 1o tests, and score 27 is the number 


of certainties. (See photograph no. 21.) 
25) Speed of ie b s le 
“ний РЭН" ш | ушш | Ров | 4, gr omis 
~S Ratio 
No. of Cases 172 39 37 23 | 56 о, 
Mean Score 3:38 3°46 3°54 | 3:09 3:50 45 | 
Уагїапсе 5:59 504 12:81 2:90 8-71 . 
! 
(886) Speed of Anxiety. БЯ | E: ^r] бйз. 
malame шилж Рова |а y Ёс) 
= каз... 2) Е.Б MM - 
No. of Cases | 172 39 37 23 76 -063 
Mean Score | 21-24) 21-26 18:02 | 19-61 20:12 
Variance | 21345 | 257:56 | 142-80 | 107-0 200-43 
zE a 
5 - "7 Point Biserials 
[D | noma | er | sti | “Ай |, yy, | Corde or 
certainties Ratio | N. pa g| Hv A. 
El. ras LA —À , 
No. of Cases | 172 39 А 23 76 “148 
Mean Score | 8-30 797 7:05 6096 | 78. 
Variance 56-81 6-92 | 729 | из 703 s 
s : ? | 
Scores 


28 to 31 are derived from th 
which the subject is required to trace ар 


€ Track Tracer test, in 
ath between two rows of 
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holes with a metal stylus; if he touches one of the holes, a buzzer 
rings and an error is counted on an electric counter (illustrated in 
photograph 12). This test is given as a Level of Aspiration test, in 
the way described in Dimensions of Personality (1947), and the four 
scores given are 28—average goal discrepancy for the time scores; 
29—average judgment discrepancy for the time scores; 30— 
average error score; and 31 longest time performance score. 


Point Biserials 
(28) Goal B Correla- 
H ? Anxiety * Psycho- " 
Dares Normals | “Stites | Hysteries | нийг A. & H. LE WIR zm 
A. & H: 
No. of Cases 172 38 37 22 75 148 
Mean Score | 1:23 213 | 216 | 559 | 215 
Variance 6:02 | 30°66 | 10:31 | 730 | 20334 
IP | Point Biserials 
(29) Judgment | Ad Ioa Correla- cd 
Discrepancy: Normals | “States | Hysterics paths A. & н. ae м, т 
1| 2 2 
No. of Cases 172 38 37 22 75 | 134 
Mean Score | —:59 | 16 451 | —:28 "33 
Уагіапсе | 527 | 2635 | 1415 | 714 | 20:09 
| | | 
| | P Point Biserials 
(зо) Mean " Anxiety «| Psycho- еше сэ» 
Е, Normals | Hysterics A. € H.| tion 
Error Score States paths ^ | Ratio BE hes y| На, 
No. of Cases | 172 | 38 | 37 | 22 73 | 167 
Mean Score | 100747 | 112737 | 11108 | 108-64 | 111/73. 
Variance 920:30 | 807:80 1276:60 840-00 (1025:30 
«T 10 Палас рў бег Point Biserials 
31) Longest P | Anxiety TX Psycho- | 4 5 F] 
Time Seore | Normals | “Staves” | Нунез! pans | 4 & B Hone | Ne нэл 
| A, & H. 
| 
| 
No. of Cases | 172 | 38 | 37 | 22 75 161 
Mean Score | 38-05 | 42-79 | 42:24| 40:96) 42:52 
Уагіапсе 151-20 | 104-17 | 238:02 | 11576 167:96 


Scores 32, 33, 34 and 35 are derived from the Necker Cube 
Perceptual Reversal test, given the following four ways. First of 
all, the number of reversals during passive contemplation for 
30 seconds (score 32); second, the number of reversals. while trying 
to maximize the number of reversals (score 33); third, the number 
of reversals while trying to minimize the number of reversals (score 
34); and lastly, the number of reversals while regarding the Cube 


° 
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passively, without trying to maximize or minimize the score 


(score 35). 
| Point Biserials 
(32) Cube ? Жылу | | Pryclo- Соте1а- 
Био Normals | “grey | Hysteries | “иг | Ae ен. es NX Yyy) Hove ds 
| 
No. of Cases | 178 41 44 25 85 "110 
Mean Score 6-16 5°78 | 5:09 6-88 5:42 
Variance 21:65 | 1438 | 9:29 23:19 | 11:72 | 
| Em 
| Point Biserials 
(33) Cube 1 Anxiety | E usd Бий Corl- | 7  __ „= 
Au Normals | де? | чете | Ж” |л.@н. Ж. гал HOS 
А.Н. 
No. of Cases | 178 41 44 25 85 077 
Mean Score 11:93 | 10°76 | 12:32 | 13:16 11:56 
Variance 51:33 | 49:34 | 10022 | 101-07 | 75:42 
Point Biserials 
(34) Cube : аша | E Correla- 
Aen. Normals | “mee | Hysterics Bus |4 ен. Jn. бз H. v. 4- 
| A. & H. E 
No. of Cases | 178 41 44 25 85 ES 
Mean Score | 3-67 366 | 3:36 | 3:20 3°51 s 
Variance 6:58 13:48 | 10:38 | 2:58 11:73 
3 ji iserials 
(35) Cube Anxi ; Сән Р PE 
1— | Normats | Anxiety | нушид | Podes | | rrela. йк 
versal тта | “States | Hysteries | Бе | A. pr Ка Г med 
No. of Cases 178 I 44. 25 8 E 
Mean Score 6:49 6:39 5:64 6:64. 695 д 
Variance 19:83 | 28-54 | 13:07 29:66 20:45 | 
Scores 36 and 37 soper from attempts to measure the 
amount of pressure exerted on a Pencil while doing the Mirror 
Drawing test (score 36), and while do 


pressure left a mark. 


of pressure was measure 
apers through which the 


" Anxiel; 2 Psych Point Biserials 
$28) Point | Normals | Ansiely | нушайх US. | ae = _ Я 
—b» o» Dan, EvA 
No. of Cases | 187 42 42 93-11 8/ Эс Exp sd 
Mean Score | 7-37 | 767 | 844 | 5-59 790 | 228 271 84 
Variance 2:62 gor 2:03 2:36 E 
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Point Biserials = 


| | | | 
(87) Point | дым, | Anxiety | pteris | Роб | a н. ша [— 
Pressure, IT | ы | States | ш шь» үер дав We Ti H. v. A. 
Wk | Ба ! Е | 
No. of Cases | 187 | 42 42 | 23 84 +167 271 84 
Mean Score | 743 | 740 | 7°69 | 674 | 755 - | —"132 | 2095 
Variance 1:98 181 2:80 2:47 | 2:30 | 


Score 38 is the body sway in a Static Ataxia test scored in 
1" units; score 39 is the body sway in the Suggestibility test 
(illustrated in photograph 8). 


| | 2. Point Biserials 
8) Stati " шу | x ych ый 
ЗУНЫ | Normals | {ШӘ | ваа | “ИДС 14:98 Жш Куу Т a 
| т EM . 5 
| _| З, 
| 
No. of Cases | 177 42 46 | 25 88 1259 265 88 
Mean Score | 563 | 8оу | 6:89 | 720*| 746 | 7 | —255 | —138 
Variance 6:98 | 26-85 | 10-41 | 26:42 | 18:39 
| | [ii | Point Biserials 
(39) Body 1987 А Correla- 
Sway Normals | “ЭМӨУ | Hysteries | PIHO- | 4, ён. С 
Suggestibility 1 States | Ут | paths Ratio | ep gp | Hove As 
ыг - а ——- 
No. of Cases | 178 42 47 25 8p | 1161 267 89 
Mean Score | 1463| 2800| 2r70| 22:68| 2232| ^ | —'159 | —'023 
Variance 37567 | 839707 | 750756 | 845:31 | 78370 


Scores 40 and 41 are in terms of answers to questions put at 
the end of the Static Ataxia and Suggestibility tests respectively, 
as to whether or not the subject had felt a tendency to sway or fall. 


Score = 2 points for ‘Yes’. Answer, 1 point for ‘No’. ^ 
| | | pu Point Biserials 
(40) Ataxi & Anxiety „| Psycho- | “эче. eee X 
Gigs | жеш | ий? Намба | "ИЙГ [AE] Да, xe va. 
| | | 2 
No. of Cases | 176 42 | 47 25 8g | :276 265 89 
Mean Score Eos rg: | r66 1:56 г68 | === | —-283 | —-058 
ariance -24 "21 23 26 eum 


1 Measurement of sway in these two tests was carried out byemeans of 
a device described in detail elsewhere (Furneaux, 1951) and illustrated in 


photograph 7. 
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| Л | (Ж 2 | “МЯ Ваа 
1- 5 ! = rela- = 
Gh) Sgen Normals | АНУ | Hysterics | айн | 4 eo gr tion ета = 
Question d | р | Ratio |. эн. | НА. 
| | 

No. of Cases 176 | 42 | 47 | 25 | 89 138 
Меап Ѕсоге r54 1:74 1:57 1-60 1:65 
Variance 25 :20 | 2 -25 E 


Tests 42 and 43 were done with the use of the Luria test, score 
42 being the number of verbal failures in the Association test, 


which constitutes a stimulus for the Т, 
amount of right-hand motor distur 
graph (illustrated in photograph 15). 


uria, and score 43 being the 
bance recorded on the kymo- 


| | pe Point Biserials 
(42) Verbal |v. nals | Anxiety Hysteries | Psycho- | д gp py | Ogea- | —— 
Failures ermats | States | Tosteries | pones Ld V Nes н. | лой 
| lox qp 
No. of Gases | 112 33 | 38 | 22 | 7 197 183 71 
Mean Ѕсогс | 874 | 533 | бза | 4:09 | 58у | 77 —'195 | 2089 
Variance 25:06 | 31:48 | 33:74 | 2285 | 32:48 
A int Biserials 
(43) Right- | LoT |ы. НЭЭ бш | FAm ane 
шин MOIS Normals | "esie | Posteries | “даг | Ae & H. бт x "p 
| | Г 
«| 
No. of Cases 99 | 30 d 31 18 61 61 
Mean Score | 2516 | 46-50) 48-84 42°78 | 47:69 s | -048 
Variance 33484 | 652-47 | 560-34 659-24 596-92 | 
| 


Scores 44 and 45 were derived from 
tions test, in which the subject is asked t 
on which are painted a number of colo 


the Kretschmer Abstrac- 
О look at a stimulus card 


З : ured letters, He is told to 
memorize the letters and their exact posi 


after the cards have been withdrawn. H 
asked what the colours of the different 1 
number of colours remembered, 45 the nu 


Anxiety 


itions and is tested on that 
€ is then, without warning, 

ctters were. 44 is the 
mber of letters. 


| т) Psyche С; 2 Point Biserials 
сыы E | Normals цер | Hysteries pathe Ан, jg 
| atio |N. у. sil 
2 | eel ВЕ аен v 
No. of Cases | 172 39 | 87 | «оз 76 126 — а 
Mean Score | 1-37 1:20 :89 1:17 гоз 
Variance 1:68 1:64 1:43 1:79 154 
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(45) Letter 


| 
| 4. € н. 


600г | Normals | iY | Hysteric | Росо 
| 
No. of Cases | 172 39 | 37 | 23 76 
Mean Score | 15:51 | 12:72 | 13:16 | 13°65 | 12:03 
Variance 16:70 18:89 1914 | 18:2 18:81 


"Tests 46, 47, and 48 are parts of a Concentration test in which 
a long series of numbers or letters is read out to the subject at the 
rate of one per second. At irregular intervals the series is suddenly 
interrupted and the subject is asked to reproduce the last six 
numbers of letters read out to him. Scoring is in terms of the 
numbers or letters repeated correctly in the right order. Score 46 
is the score on the numbers test, 47 on the letters test, and 48 on 
the numbers and letters combined. 


| | | Point Biserials 
6) Concen- i 5 | la- 
Uanons | Normals | Аш | нуш | Pocho- | 4, агн.) y deer 
numbers | | Ratio | es py, |^ Hive А. 
No. of Cases | 170 39 | 37 23 76 | гоо 246 76 
Mean Score | 29:24 | 20067 | 26:89 | 2692 2678 | —— 183 017 
Variance 3434 | 41:49 | 51:54 | 4927 | 4578 | 
EM e —- 21. ee i L.S 
(3 = 
| | | Point Biscrials 
(47) Concen- 5 | Correla- 
io: Р Anxiety | | Psych РР “ш 
tration: Normals e | Hosteries | MA Ё SH.) йш [уу me 
З. Il сы ЗОМИН | Ёс 5 
| 
No. of Cases | 170 39 37 23 76 Ч 246 |. 76 
е 25:86 | 22:56 | 23°03 | 2130 | 2279 | = 231 033 
ance | 30-45 | 5015 | 4980 | 58:31 | 49°37 | 
TO eae | e | | т | int Biserials 
ati : fo “Апх ^sycho- 1 
tration: | Normals | Sates, | Hysteries | paths | Hv. 
| | 
2 d | 5 | 
EG: | | | 
No. of Cases 170 | | 6 259 | 246 76 
39 | 37 | 23 791 see | } 
еп Score 554:| 4923| 4992| 47:52 | 49:57 | | 930 | 028 
ariance 103:42 | 15976 146:58 | 178-08 | 15142 | | 


Test 49 is a Dark Vision test. The instrument used was an 
A.R.L. Adaptometer Mk. та, as produced at the Admiralty 
Research Laboratory. The instrument is illustrated im photo- 
graph g. It consists of a rectangular box with a lamp at one end 
and a circular diffusing glass viewing screen at the other. The 
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brightness of the screen can be varied by the insertion of sixteen 
graded apertures at a point about half-way between the lamp and 
the screen. The Adaptometer contains a movable filter holder 
which enables the user to insert any one of three filters. Thus a 
very wide range of illumination is at the experimenter’s disposal, 
from 3:961 x 1077 to 7947 X 10-? foot candles. Immediately 
in front of the viewing screen is a metal plate cut into the form 
of a sector which can be rotated into eight separate positions, 
being determined by a clicker mechanism. These positions may 
be described in hours as on a clock, i.c. 12 o'clock, 1.30, 3 o'clock, 
and so on, and the subject is asked to report on the position of the 
sector at each point of illumination of the screen. The apparatus 
operates on a 6-volt 40 ampere-hour battery and a rheostat can be 
adjusted so that the needle of an ammeter can be made to сог- 
respond with a calibration point on the meter dial. This latter is 
visible through a tiny inspection window and the reading is checked 
at least once during the course of testing. Performance on the test 
was scored after 3 minutes, after 6 minutes, and after 10 minutes, 
taking three readings of filter and aperture each time. Further 
details regarding the test are given by Gravely (1950). 


a быш. са. Ши. 
je | 5 Point Biserials 
Gg Dark | холын 157 155141 Ерэн Та, аны | © Ix = Л 
He | Кайо | aog) Hove Ae 
| ee 
Mn SE 22 2 bu Pio | 23 5 sus | ota | 7249 82 
3 | 5 5 15°35 15°27 | 15:18 = E :045 
Уагіапсе 525 1737 | 1804 | 20-68 | 1546 | 
= жок: BEE Ls m й 03 


The Flicker Threshold was measured with 
(Type 1200 C). This instrument isa stroboscopic tachometer and 
is primarily used in engineering for measure ent of sh ds 

rin і aft speeds, 
where the end of the shaft is inaccessible, ica the оросо 


measurement of other moving objects. The St ] 
i 3 rob nsists 
essentially of a neon tube which can be ma oflash co 


de : 1 

at frequencies of from боо to 14,500 flashes р нара 
accuracy of ---т per cent when the instrument is ute, yi 3m 
brated. Calibration of the instrument WHE capri et y Р 4 
each testing session. Testing was carrieq out at the 3 out БС ын 
Vision 'test, which lasted a standard 10-minute end of the Da 1 
adaptation is known to influence results. Comple | 2, the level s 

ing testing are given by Gravely (1950). € details regard- 


a Dawe Stroboflash 


& 
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| | | correla. | Point Biserials 
(50) Flicker | Normats | Атау | нуд | Poete- | 4. ern! ЭШЕ |— 
Threshold | Normals | “States | lysterics | paths | A, & Н. | К Ху A | 87 


No.ofCases | 154 
Mean Score | 2589-0 


à 2469-4 | 2531-1 25043 | 250377 | 
Variance | 59,810 


| | 
36 45 29 | 8: | +197 | 235 | 8r 
20,470 | 68,100 | 10,440 | 47,360 | 


"Tests 51 and 52 are different ways of scoring the Perseveration 
test. The subject is required to write 555 as fast as he can for 15 
seconds, and then S's reversed for 15 seconds. Then he is required 
to write S's and S's reversed alternately for 30 seconds. Score 51 is 
the number of symbols written during the first two periods com- 
bined over the number of symbols written in the third period. 
Score 52 is the total number of symbols written during all three 
periods. 


Point Biserials 


2524 : [ULT READ ET. 
Slien | Уны Gai | Hyserics | Tony 14. el LO 
| | | atio | ен. EvA 
шин | | 5 
No. of Cases | 174 | 40 | 40 | 22 | 80 | -og2 | 
Mean Score 167 | r69 | r74 | r72 г]! | 
Variance 558 | 8 | 8 41 32 | 
! 1 1 | 
1 = T = 5 1 Ї EUN 
Point Biserials 
(52) Speed of | | хань | | | саниа | 
i " Anxiety | jp uus, Psycho- | Correla: 
senting | Normals States | Hysterics paths 4. ен. 
| | | 
5 ies wees - 
o. of Cases | 174 40 | 40 | 8o 
Mean Score | 61-37! 5570 | 5048 = | 8794 
Variance 225:30 | 03:19 | 257:69 | 33004 | 235°25 | 


Score 53 is derived from an experiment on the autokinetic 
phenomenon. The score is the number of seconds before the subject 
reports movement. 


(53) ne | | | Correla- E -Зь анх e 


kinetic м Anxiety | кыш, | БОР | 4, eg H.| “thon 
phonetic | | Normals | “si” | Нуцайн | "Pins | 4 ME x D ГЭЭР 


No. of Cases | 126 37 42 23 79 :095 
Mean Score | 4ф4:| 5746| 4517, 4835| 5092 
Variance 2436-76 [0524750 11988:58 2429:87 [265756 


o 


e 


An examination of these objective behaviour test scores shows 
that quite a large number of them differentiate with considerable 
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efficiency between the various groups. Among the more successful 
are those tests which previous work had indicated as being likely 
measures of neuroticism, i.e. the Manual Dexterity tests, Tapping 
Scatter, Static Ataxia, the Luria test, the Abstractions and Con- 


esting divergence from the usual pattern, namely, the Body Sway 
Suggestibility test, which gives a barely significant differentiation 
between the normals and the hysteric plus anxiety state (r — 159). 
This is lower than the differentiating capacity of the Static Ataxia 
test, which gives a comparable correlation of :255. The lack of 
discriminating ability of the Body Sway test is probably due 


ent sample of 50 subjects, were found to be rather lower than 
anticipated. : 

The next group of tests to be discussed consists of measures of 
expressive movement. These have been described in considerable 
detail in another publication (Eysenck, 1951), and only brief 
descriptions will therefore be given here of each test. These tests 
are of particular interest, in spite of the fact that they universally 
fail to discriminate even at the 5 Per cent leve] between the various 
groups tested, because, as will be shown | 
tests which discriminate at very high levels of 
normals and psychotics. The importance о 
those tests which give the best differentiation 
and normals do not differentiate at all between neurotics an 
normals, will be obvious to the reader; it wil] be referr Ер 2 d 
later in the text. би tO ag 

The first two tests (scores 54-63) are the « ! 1 
"Three Squares’ tests. The subject is handed 9 uat сомы анд 
and is asked to draw, first, three circles, and, second Бо орар 
No further directions are given and no questions a, three squares. 

Linearity. Scoring is as follows: answered, 


The three circles are touching or are © ч 
Oncentric aj i i 
nd linear, i.c. 


< 
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a straight line can be drawn in such a way as to pass tirons all 
three circles. Score = g points. 

The three circles are not touching and are not concentric. 
Score = 2 points. к 

The three circles are not linear. Score = 1 point. 

The three circles are not scorable. Score — o points. 


Point Biserials 


(54) з Circles: дундад Anxiety |р Psycho- | 4 c py] “ш z 
Linearity | States | Уйа рау | AL Ratio |.” HSA. 
| | | 4. “өн. 
| | 
No. of Cases 172 39 52 | 5 76 114 | 


Mean Score | 1:63 1:56 1:46 r56 | 1°51 
Variance :27 "25 26 26 "25 | 


Time taken to draw the three circles. 


m л Point Biserials 
ies | Psycho- | a 
Hysterics Dx А. €& H.| tion | 

garis Ratio 


Anxiety | 


(55) 3 Circles: 
Time States 


Normals 
Nyga | НА: 


No. of Cases | 172 39 37 22 | 76 *о71 
Mean Score | roor | 1092| 9:86 | 11-82 | 10-41 
Variance 63:72 | 103707 | 44°56 | 95:58 | 73:90 


Smallest diameter of the three circles.^ 


m 2 ru Point Biserials 
56) Smallest Anxiet; ü Psych i 
Diameter, | Normals | “шр | Hosteries | им | a. ан.) fon Ку, 


Ж; ен. H. v. A. 


No. of Cases | 172 39 37 23 76 :077 
Mean Score | 815:  3308| 31:62| 2739| 3237 
ariance 33920 | 232-40 | 414700 | 211-10 | 317:00 


Largest diameter of the three circles. 


Point Biserials 


Correla- 
(57) Largest Normals | Anxiety ics | Poco- | 4. e H.| tion 
iameter | Normals | “States | Hysterics | “paths | DN: H.v. A. 


| Ratio A. & H. 


Mean Score | 4r51| 4231| 4r89| 3565| 42°11 
Variance 703:00 | 518-20 | 409:10 | 389:30 | 502-20 


| 
No. of Cases | 172 39 37 23 76 | 071 | 


*-. 


9 
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Average diameter of the three circles. 


Conela. | Point Biserials 
: — 3 Ў : rrela- | 
CRARO | mat | шй | Hoi | Pace | DUE |N Yo y) мул 
—} L 2 | ишени ИНЕ 
No. of Cases | 172 | 39 37 33 | 76 | оў | | 
Mean Score | 36:34| 3718| 36-76 | 31:74 | 36-97 | | 
. Variance | 458-40 | 331:30 | 444-70 | 242730 | 381-40 | | | 


Linearity score for the three squares. Scoring is as follows: 
The three squares are touching or are concentric and linear, 


i.e. a straight line can.be drawn in such a way as to pass through 
all three squares. Score — 3 points. 


The three squares are not touching and are not concentric. 
Score — 2 points. 


‘The three squares are not linear. Score = 


— I point. 
The three squares are not scorable. Score — © points. 
ч А ja. | Point Biserials 
д , | Correla- —— 
GORENA | Normals | Ашу | puis, Due аен) dn [xo T 
Ratio (Xv yy) Hove As 
EE НИИ EE | 
| | el 
No. of Cases | 172 39 | 2 6 ч 6 
Mean Score | 1-70 r62 | ah 138 Uo 95 Ч 1183 
Уагіапсе 24 04 | -25 27 25 
= 1 
Time taken to draw the three squares. 
| йн | б Point Biserials 
6 b | Anxi G ^ |— 
(60) 3 Squares Normals 2216-4913 а ү үз. Ж 
К Квюйсгїї | | Ratio |“ ын, | Hove de 
| == » n 
No. of Cases | 172 39 37 | 2 
| 3 6 *095 
Mean Score | 1848| i597| 12:65 | 135, | 1436 S 
Variance 106-98 158-50 89:51 | 65:99 126-07 


Smallest diameter of the three squares, 


(61) Smallest 


Point Biserials 

Anxiet; " T Correla- 

Diameter | Normals | лг | Hysteries | Pocho- | 4. eg py, "i Pu F 
| Ratio | p | Hv 


No. of Cases | 172 39 

37 23 6 Е 

Mean Score 3733, 36°15| 38-11 29°13 7 TS 

ariance 519'10 | 298-00 | 526-90 | 181-00 
| 
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Largest diameter of the three squares. 


n | |, Point Biserials 
(62) Largest Anxiet; g р MEC 
Boer! | Линар | “Ний | нан | Pate |а ен. ten [уе [куу 
. ia T 14-44 LE LOST 
| PE 
No. of Cases | 172 39 ! 37 °з | 7 ато 
Меап Score 4040| 49:23 | 5000 | 37:39| 49°61 
Variance 838-00 | 723:10 | 961-10 4! 1-10 | 827-80 2 
ded ——ÀÀ Él. ILI ЭН! — си гтэ -- 
Average diameter of the three squares. 
т ггг й 
3) Average Anxiety t 2 ; orrela- | 
(60 Average | Normals | Anxiety Нуман | ул аен. E NEUE. 
E |! аан ies аер ЗОН ЗӨН 
No. of Cases | 172 39 37 | аз 76 чоо | 
Mean Score | 41-63, 43:33| 4324| 3391| 43:2 
Variance 635:90 | 417:50 | 694:70 | 243°10 | 545:00 | 


It will be noted that the linearity score for the three squares has 

a significant F ratio and a corresponding correlation ratio of :195. 

: Similarly, the linearity score for the three circles is almost signific- 
ant. It is interesting to note that these two scores are the only two 
in the battery of expressive movements which are insignificant in 
discriminating between normals and psychotics. 

Size estimation of a pound note (the subject is instructed to 
draw a rectangle the size of a pound noteon a clean piece of paper. 
The score is the length of the diagonal. All measures are in milli- 
metres): r 


| | Point Biserials 


(69) Diameter: Anxiety | "E ? Giga 0-0 
5| Normals | "Se? | Hysterics | “Өр, | A. @ Н.) — tion Н 

'ound note States paths Ratio ОРОГ 5 | B nA 

d | 4 ён. 


No. of Cases | 172 39 37 23 76 2105 | 
Mean Score | 154-53 | 156:67 | 157:57 | 14913 | 157:11 | 
Variance 398-00 | 438-60 | 502:30 | 317740 | 463:50 i | 


Size estimation of a half-crown (the subject is instructed to 
draw the size of a half-crown piece. The score is the length of the 
diameter. All measures are in millimetres): 


Point Biserials 


(65) Diameter: | раду | Anxiety + lll эм Ge |. Se сс... 
Я Я аен.| t " 
Half-crown als | “States. | Posters | “Байг Ratio |y ы Hove: 
No. of Cases | 172 39 | 97 | 23 76 "IIO o 
Mean Score | 35414 | 36:31 | 35:38 33°61 | 35:86 
Variance 36:28 | 48:22 | 22:58 | 21-07 | 32:95 
S.S.P. я p 


146 ТНЕ NEUROTIC DIMENSION: 


| Scores 66 and 67 are taken from the *Fewness of Lines’ test. 
The subject is given a clean sheet of paper with two parallel lines 
drawn on it, five inches apart. He is asked to draw lines across the 


»- 


Sheet between the two lines given, and the time taken (score 66) 
and the number of lines drawn (score 67) constitute his score. 
) Р | Pint Biserials 
(e | Normate | Аш» | ну Pod д ан, Omar ЗГ 
Ratio E H| Hv Ae 
No. of Cases | i171 39 37 23 6 +08. 
Mean Score | 37-68 4403| 3070| 48-70 ы E 
Variance 1266-03 |5730:60 43105 |1120:58 3155-34 
a L эйе 
| | Р Point Biserials 
«Paga | ээний | ДУ | нян | АВ | e| Ои |_ иви 
RS s. i Ж Ratio 222 Hv. A 
No. of Cases | 172 | j 
Mean Score | 100 | 1938 | sage | 1036 12651 22 | 28 | 26, 
Variance | 62-08 3730 | 332 6- SEP) | “вл 
| 4 | 3613 | 36:39 
НЭЭ --------010-1 He 


(68) Average | 


: | | 
Amplitudes | Normals | Apsiety 22 Жан | Pim илийн 
8 Nv. le 

asi & |e 

1 aes | 172 39 6 | 
Mean Score | асч 23 5 Ы | 

à 5708 | 25/54 | 2472 | 23:48 15 5 
Variance 25:96 | 36-78 27:69 HR xul | 
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F 4 | | сын. | _ Pent Biserials 
(69) Average | woran | Алау | нн, | Рио lA S H| ton | op 
Wavelength | Normals | "uates | ene | рей | е l^ эн| Hav 
. 1 | H | | 
No.ofCases | 172 | 39 | 36 | 23 | 75 | 410 | 
Mean Score | 100-17) 98:97| 02:50 97°39 | 95°87 | | 
Variance | 509:30 | 709:40 | 562:10 | 492-900 | 640-80 | | 
| | 


The last five scores are derived from the ‘Measuring Distances’ 
test. The subject is asked to indicate what he considered a 2 ft. 
distance to be by placing two matches 2 ft. apart on the table. 
Next, he is asked to indicate the distance of 1 ft., then one of 8 in. 
The distances are recorded only after he has finished the whole 
test. Scores 70, 71, and 72 are in terms of the distance between the 
two matches in }-їпсһ units. 


Point Biserials 
(70) Distance: | Normals | Anxiety | Hysteries | Pocho- | д. ен. brem 2207 
2 ft. test States paths Ratio 39 12 10 УСА: 
No. of Cases | 159 4t 4n | 23 88 | 055 | . “ёр 
Mean Score | 93:69| 92:20| 9468| 93:70| 95:52 
Variance 170:19 | 264:96 | 226-14 | 205/49 | 24294 X. 
1 | | m Point Виенай 
(71) Distance: Anxiet js | Pocho- | (жш 
Entet | Normals | “др | Hysteries | “ping | As SH. na "HET Т 
No. of Cases | 159 41 47 23 88 7071 
Mean Score | 48:37 | 47:10 | 49°00 | 48:26 | 4811 
Variance 5617 | 5764 | бгог | 60:84 | 6015 
Ў : А | Correla- |. 
(та) Distance: | piat, Ausiely | Hysteries mos A. € H.! бт 
| E und 
No. of Cases | 159 ах. | y] 23215188 400; 
Меап Ѕсоге | 81:33 | 29:03 | 31:38 | 30°96 | 30:70 
Variance | 2344 | 23:27 | 3207 | 2977 | 2819 
! | | ! - 


Scores 73 and 74 are the total overestimate and the total 
underestimate, respectively, i.e. а summation of all positive or 
negative errors respectively in 4inch units for the whole group 
concerned, 
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7 | | | БЭЭ Point Biserials 
(73) Total | ууна | Anviety Нушайа | Perche- | ep pp | Corel зс Е. р 
Overestimate | 577 States ра! Ratio |N; SH | H: v. A. 
| seers 
No. of Cases | 159 Dis da arra 23 98 | сол 
Mean Score 895| 827| їг2 9:61 985, 
Variance 13488 | 167:20 | 308-2 252:52 | 242-06 
| z "m Point Biserials 
Total Anxiety | py и, | Psycho- |, „ i M35 
шеше | Normals States | Муште | "onm | A. © H. ГЭЭЛ | Nv. iva 
| A. & H. 
9, А c еа S 
No. of Cases | 159 4t 47 | 23 98 | -077 
Mean Score 11°58 | 15:05 11:74 | 18:70 | 13:28 
Variance | 261-30 | 328-35 165:76 | 204-68 241:36 
| | 


Another group of tests, namely, the so- 
niques, is represented in this 


‘score is one of normality. 


| Point Biserials 
(75) Rorschach: | |. ie йына. | T dae. 25 
Жашаш | Nomats | Ашу | бин us | er 08 [n Nc. 
M ] | Ratio A. & H. 126 ү 
--4--11| а - 
No. of Cases 157 35 | 2 
42 21 = 77 
Mean Score | 2:92 231 | 2:38 | 1.76 M. 28 ane 023 
Variance 237 | гчо 2:05 2:09 2:05 | 
122:21| os T 
(19) Rorschach: f S = | 1 3 Р] Point Biserials 
Ко ; B TN orrela- 
уоп formals Ed Hysteries | fae |4.€ H. omes HU 
- | lacia 8, | диб d. ern] Be 
No. of Cases 157 35 42 21 | 5 
Mean Score | 6:54 5:83 6:02 544 хи 305 bor 22) 
Variance 466 | 397 17373 | 293 | 3-86 33 
| 1 | 


Of the measures reviewed above, twenty-eight were chosen for 


orial analysis. These twenty-eight scores will 
numbers which they гесеї i 


© 


—:054 
085, 
= 7135 
*099 
"149 
—-069 
+087 


—:016 


153 
116 
—'051 
+140 
-089 
—-261 


—+299 


"PII 
"104 
307 
E294 
— 185 


—122 


1026 
—:023 
— 198 
—:203 
—:166 


—':018 
154 
-306 
169 


—*151 
—*122 


—:019 
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variables; these correlations are given in Table XXV. In view 
of the fact that not every subject had completed every test, it was 
impossible to use the total number of subjects for this analysis. 
Ninety-six subjects had done all except three of our twenty-eight 
tests. Thirty-eight subjects had done all tests, including these three 
(dark-vision, flicker threshold, and the Luria tests). Gonsequently 
the great majority of the correlations in the table are derived 
from N = 96; all correlations involving the above-mentioned three 
tests are derived from N = 38. As pointed out above, there is no 
reason to expect that failure of a subject to carry out one of the 
tests was in any way correlated with that subject’s personality or 
motivation. All the subjects used for this analysis were of course 
normals; no neurotic soldier was included in the correlations. 

We may now proceed to analyse this matrix. Five factors al- 
together were taken out before the residuals could be regarded as 
definitely insignificant. These centroid factors were then subjected 
to criterion analysis. We may regard the ‘Hysteria plus Anxiety’ 
group as our nuclear neurotic group, and in Table ХХУГ, 
column 2, are given the biserial correlations of each of the 28 tests 
with the normal-neurotic dichotomy, defining neurotic in terms of 
the combination of hysterics and anxiety states. In order to make 
all these values positive, the signs of 14 tests were changed, as 
indicated in the column headed *Muitiplier. It is with these 
changed signs that we will be dealing throughout the discussion 
that follows. 2 

Column g gives the saturations of each test with a factor 
obtained by finding that vector which maximized correlation 
between columns 2 and 3. This factor, therefore, is by definition 
identified as a factor of ‘neuroticism’. The correlation between 
columns 2 and 4 is a good deal lower than in several other analyses 
reported in this book; where usually the correlation between factor 
and criterion column is between -6 and °8, it is only -400 in the 
present case; We do not have far to seek for a solution. As pointed 
out before, the normal soldiers were considerably more intelligent 
than the neurotic soldiers through an error in selection. But intel- 
ligence has not hitherto been found to have any close relation with 
neuroticism. It follows that the intelligence test (test 2) has a high 
correlation with the normal-neurotic dichotomy (r = -481), but a 
completely negligible factor saturation (7 = 109). Unfortunately 


1 Bach of the correlations is based on the largest possible number of subjects 
who had completed each test. 
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it is not only the intelligence test which is affected in this way, but 
every test which is correlated with intelligence. Thus the low 
criterion correlation appears to be due largely to an irrelevant 
sampling error. Fortunately, this error does not affect the main 
part of our argument, which is concerned not with the establish- 
ment of the fact that a factor of neuroticism exists, but rather with 
the question of whether the diagnosis of ‘psychopathy’ forms part 
of this factor. 

Accordingly we have. calculated a new criterion column 
(column 8), made up by correlating each test with a new criterion 
consisting of the old neurotic group (hysterics and anxiety states) 
and the psychopaths weighted equally. Column 9, then, is that factor 
which maximizes the correlation with column 8. If this correlation 
is equal to or larger than that found before (r = -400), then it 
would follow that psychopathy forms part of our best possible 
criterion. In actual fact the correlation drops to r = -320, thus 
indicating that psychopathy is not part of the nuclear concept of 
‘neuroticism’, and hence not part of our best possible criterion. 
This conclusion is not influenced by the unfortunate selection error 
which introduced differences in intelligence between normal and 
neurotic groups, as both the hysteric-anxiety and the psychopathic 
groups are equally affected by this error. In the absence of repeti- 
tion and confirmation of this finding it would be unwise to make 
too definite a statement, but the data strongly support the con- 
clusion that psychopathy is likely to have projections on the 
neuroticism axis, but unlikely to lie on it. We cannot, on the 
basis of these data, say whether the proper position for psycho- 
pathy is in the space generated by those axes already located 
(psychoticism, extraversion-introversion), or whether a new dimen- 
sion will be required to accommodate this particular set of person- 
ality traits. In the absence of factual information, it would be idle 
to speculate. 

We may now turn to certain observations suggested by the 
results in Tables XXV and XXVI. We have only attempted to 
interpret the first factor extracted from our matrix, i.e. the ‘neuroti- 
cism’ factor: the question arises as to the nature of the other factors. 
As the experiment was set up with a definite problem in mind, and 
did not therefore include controls which would facilitate such inter- 
pretation of factors other than the first one, it is with considerable 
hesitation that we suggest the possible meaning of factors two and 
three, i.e. columns 4 and 5. The second factor appears to divide 
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fairly clearly the verbal tests from the non-verbal, in a manner 
similar to that usually found in the analysis of batteries of intel- 
ligence tests. On the one side we have the flicker fusion, manual 
dexterity, finger dexterity, body sway, leg persistence, tapping, 
abstractions, and concentration tests; on the other the vocabulary, 
word connection list, Maudsley medical questionnaire, worries, 
likes, interests, and annoyances tests, as well as the Luria verbal 
failures and the Minnesota о scale. There are few exceptions to 
this rule, and those that stand out are perhaps not inexplicable. 
Thus the Luria test appears with the verbal tests, which is perhaps 


intelligible in terms of the exclusively verbal stimulation provided, 
and the verbal response required, 


The emergence of this factor 
hypothesis formulated elsewhere 


(Eysenck, 1950). ‘It is widely 
m hereditary basis, but is also 
ial and other environmental 
rge, that personality tests of 
lated rather more closely to 
onative and affective traits; 
Sgestibility, of autonomic imbalance, of 
5 and of motor expression appear so closely 

al properties of the nervous system E 
sensory equi n that the 
likelihood of hereditary “ен лу, pee 
ests employing unstructured material, 
* questionnaire type, would appear to 
c al aspects of a person's life story and be 
subject to day-to-day fluctuations of mood and outlook. If we may 
use an analogy from physics we might say that tests of this type 
geal with problems of hysteresis rather than with those of struc- 
ture.’ As is well known, in the cognitive field also environmenta 
influences are much stronger in connection with verbal than in 
connection with non-verbal t 


› ests. Further proof fi the heuristic 
hypothesis advanced here will be given in 5 later карш. 


The third factor has reasonab rations; ice 
Grade), as determined Ь Эргэн ea Cig 


з У st of intelligence, con- 
centration, end vocabulary test; it seems likely that шин dealing 
here with an intellectual facto i 
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amplifying this suggestion. Nor will we attempt to interpret the 
other two factors extracted. 

Interesting points are suggested by a comparison of columns 2 
and 3. Apart from the main hypothesis regarding the causes of 
proportionality of these columns, there are certain subsidiary 
assumptions non-fulfilment of which may cause the correlation to 
drop quite considerably. One of these assumptions is that of linear 
regression of each test on neuroticism. Figure 23 will illustrate this 
point. Let the abscissa be the normal-neurotic continuum, divided 
at point Y into two parts, viz. our normal and neurotic groups 
respectively. Let A, B, and C be the regression lines of three tests of 
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NORMAL *NEUROTICISM CONTINUUM NEUROTIC 
Figure 23 

neuroticism. It will be seen that test C will give a good discrimina- 
tion between normals and neurotics, but show very small correla- 
tion with other neuroticism tests in the normal group. Test B, 
conversely, will show good intercorrelations within the normal 
group, but will not give such good differentiation between normals 
and neurotics. Only test A, havinga linear regression line, will show 
strict proportionality between discrimination and intercorrelation. 

The figures in Table XXVI suggest that questionnaires of 
various kinds follow the regression line exemplified by test С; it 
will be seen that for both the Maudsley and Minnesota question- 
naires the figures in column 2 are considerably higher than those 
in column 3. Tests of the Pressey Х-О type and the ‘Annéyances’ 
kind, however, appear to follow the regression line of test B; the 
figures in column 3 are considerably higher than those in column 2. 
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The word connection list, the Concentration tests, the Luria motor 
disturbance, and the Leg Persistence test are examples of tests 
apparently following regression line A. It is difficult to generalize 
from these data, but it does appear reasonable to frame an 
heuristic hypothesis to the effect that verbal tests (both of the 
direct questionnaire апа the indirect cross-out type) have curvi- 
linear regression on neuroticism, while objective behaviour tests 
tend to have linear regression lines. Further research is urgently 
needed to provide more direct evidence on these points, 

We must now face the question of the validity of various 
batteries of tests which might be constructed from those included 
in our battery. The most obvious method of ascertaining this value 
would perhaps be that of calculating а multiple correlation 
between all our tests and the neuroticism factor, which would thus 
constitute our criterion. This procedure is obviously fallacious; 
while it gives us the satisfyingly high R of almost "90, we must 
regard this value with suspicion until the whole battery has been 
validated on another sample of normal and neurotic soldiers. 


method was used in order to minimize 


ments were based on long 
previous analyses. These two sets of tests were called A and B, and 
included respectively tests 4. 


> 5» 14, 16, 19, 39, 42, 47, and o, 11, 15, 
17, 24, 53, 45, 46. It was planned to calculate R for these two sets 
separately and in combination; it was thought that in thus making 


‚ а choice of the tests to be used in advance, those chance factors 
which tend to inflate multiple R would have much less scope to 
assert their influence. 

Subsequent to the com 
dropped from set A becaus 


analysis, test 9 was dropped from Set B becaus 
between this test and the factor was 
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batteries may be criticized, it may be noted that the effect on the 
multiple R's is very small, and on the whole tends to lower rather 
than to raise them. 

The values of R for these two batteries of seven tests, calculated 
according to the Lubin-Summerfield (1951) method, are -755 and 
:728 respectively; for the combined batteries R = -817. These 
values may still be overestimates, but it seems unlikely that the 
true validity of either battery would be much below :70, or that of 
the combined battery below -8о. As a check on this conclusion, 
another method of calculation was tried. Eighteen tests had been 
selected before completion of the calculations, and on the basis of 
previous experimentation, as most likely to correlate significantly 
with the neuroticism factor. Two of these had to be dropped for 
reasons already given (Body Sway suggestibility and Annoyances), 
so that we are left with sixteen tests—45, 14, 15, 16, 17, 19, 24, 4, 
5, 6, 38, 11, 52, 49, 42, 43. A multiple R was then calculated, giving 
equal weights to all these tests, and making no use whatsoever of 
the statistical findings of the present analysis. In this way, both 
selection of tests and weighting system are unaffected by obtained 
results, and the resulting R should give an estimate of the lowest . 
possible validity coefficient for our battery of tests. The value of R 
turned out to be :77, a value which strongly supports our previous 
conclusion. Any reasonable system of weighting based on previous 
work, and not using the figures derived in the present research, 
would raise this figure to -80 or over. It seems reasonable, then, to 
conclude that the battery of tests here assembled has a validity of 
approximately -80 or thereabouts: it will be shown later that a 
similar battery has a reliability of at least :85, and probably above 
‘go. These figures strongly support the argument advanced іп 
Dimensions of Personality that in neuroticism we are dealing with 
a personality factor which can be measured as reliably and as 
validly as intelligence. In further support of this contention, a brief 
description will next be given of two experiments applying similar 
methods to those described to children. One of these experiments 
deals with objective and verbal tests, the other with the Rorschach. 

Hitherto we have been dealing exclusively with adults, because 
most of the work done on emotional stability and neuroticism has 
been done with subjects over eighteen years old. If-our general 
hypothesis of neuroticism as a kind of constitutional weakness be 
correct, however, it would appear essential that experiments with 
children, using analogous tests to those described above, should be 
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shown to give similar results. From the practical point of view, too, 
the measurement of liability to breakdown in children is probably 
far more important than such measurement in the case of adults; 
preventive work is more likely to be successful at a relatively early 
age, and the possibility of ‘screening’ children who need psychiatric 
attention is a very inviting one. Two comparable studies have been 
reported from our laboratory, one using objective behaviour tests 
and pencil and paper tests, the other using projective material. 
In the first of these studies (Himmelweit & Petrie, 1951), 50 
normal and 50 neurotic children were matched on sex, age, and 
Т.О., and tested for five hours by means of individual tests and for 
three hours by means of group tests. Selection of normal children 
was carried out by asking head-teachers to select those children 
for testing who appeared relatively well adjusted, i.e. who showed 


no obvious signs of emotional maladjustment or scholastic back- 
wardness. Selection of neurotic 


choosing consecutive admissions to the Maudsley Child Guidance 


Clinic: These ‘neurotic? children were later rated by the Senior 


Psychiatrist on a five-point scale for degree of adjustment; two 
were rated well-adjusted, and s 


It will be seen that the ne i 


Petrie, 1951); i 
normal children in precisely those 


children was simply on the basis of 
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Fourteen tests showing the best discrimination were selected, 
and a discriminant function analysis performed. The method used 
was that originated by Penrose (1947) which gives a close approxi- 
mation to the linear discriminant function when ай intercorrela- ` 
tions are approximately equal. This method of discrimination 
requires the computation of a ‘size’ and a ‘shape’ score for each 
person. These scores are respectively the sum of the individual 
standard scores of each test, and the sum of individual scores suit- 
ably weighted, the weight being proportional for each test to the 
observed difference of the two group means for that test. The shape 
and size scores tend to be uncorrelated, and the linear function 

_ that best differentiates between the groups (i.e. reduces the number 
of misclassifications to a minimum) is then determined by calcu- 
lating the multiple regression of the normal-neurotic dichotomy 
on the two scores. 

Figure 24 shows the scatter diagram produced by plotting each 
child's score against the ‘size’? and ‘shape’ co-ordinates; each dot 
represents a neurotic child, each cross a normal child, while the 
diagonal line drawn through the cluster of points is the linear 
discriminant. It will be seen that nine maladjusted and ten well 
adjusted children are misclassified according to the criterion, giving 
a total misclassification of 20 per cent. This should be compared 
with a chance level of misclassification of 50 per cent. The quad- 
ratic discriminant was calculated, avoiding the assumption that 
the variance-covariance matrices within each group were the same, 
but did not aid significantly in correct classification. 

The multiple correlation coefficient was found to be + 70, 
a value which presumably capitalizes on chance errors to some 
extent, and is subject to shrinkage in future applications of the tests. 
However, it was found that ‘neuroticism’ scores calculated on this 
basis correlated significantly with the psychiatrist’s ratings for the 
neurotic children, thus showing validity when applied to a sample 
which had not determined the actual weights used. 

The second of these studies is very similar in design and treat- 
ment to the first one (Cox, 1951). Subjects were divided into an 
experimental group of sixty neurotic boys attending the Maudsley 
Child Guidance Clinic, twenty of whom were aged from 8 years to 
9 years 11 months, twenty from 10 years to 11 years 4I months, 
and twenty from 12 years to 13 years її months, and a control 
group of ‘normal’ boys matched for age and intelligence. An indi- 
vidual Rorschach test was given to these children, as well as an 
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individual intelligence test of the non-verbal type (Progressive 
Matrices). In scoring the Rorschach test, Beck’s (1944) scoring 
categories were used, with minor additions from Klopfer and Kelly 
. (1942). S 
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matrix, which is reported in full elsewhere (Cox, 1951). The 
saturations for these factors of the twenty-seven items—twenty-five 
scoring categories, I.O ., and criterion—are given in Table XXVII. 
Also given are the correlations between the various items and the 
normal-neurotic criterion (cf. column ‘D’.) The first value in the 


TABLE XXVII 


I I ш | iw v D 
| 
1. Criterion 265 | 734| 208 | —196| —121| (83) 
2. А 484 171 441 220 139 12 
9. Ad. 360 503 | —305 | —184 226 43 
4. Hds. 403 562 | —182 127 218 49 
5. Fire 392 | —378 189 089 | —178 | —o8 
6. Geol. & mtns, 474 | —188 342 397 084 03 
7. Water 441 | —505 174. 056 | —037 | —10 
8. Arch. 477 072 598 | — 110 | —084 23 
g. Mech, Sci, 362 398! 417 084 | —273 50 
10. Mise, 140 | —527 252 075 267 | —47 
11, Ws. 188 154 478 142 | —332 25 
12. D. 825 153 | —183 | —216 199 | 48 
13. Dd. 602 257 | —278 257 | -050 25 
14. I's, 942 154 | —268 095 | —o89 34 
15. F+’s 841 | 261 | —094 058 | —305 40 
16. F—'s 661 | —o69 | —264 | 337 382 | —03 
17. F+ 731 | 435| —289 | 142] —157| 40 
18. F— 501 193 | —380 197 508| 18 
19. ЕМ 426 | —128 | —129 047 105 00 
20. CF— 199 | —477 | 321| —149 | 335 | —33 
21. Ci ^ 484 | —560 056 319 | —050 | —38 
22. V & Y 575 | —475 276 | —363 169 оо 
23. LO. 030 254 512 257 | —084 32 
24. No. 897 | —063 | —354 149 | —177 21 
25. Av. time —448 461 167 250 365 | —o8 
26. Range -451 420 | 161 337 489 | —03 
27. Failure —811 | —297 | —036 516 | —246 58 


Decimal point properly preceding each entry omitted. 


*D* column, which has been put in brackets, is the square root of 
the communality for the criterion, and is an estimate of the degree 
to which the total battery has succeeded in measuring the trait 
underlying the dichotomy, i.e. neuroticism. 

The interpretation of these factors is relatively straightforward. 
The first factor appears to be one of productivity or^number of 
responses, having its highest loadings on ‘total number of responses’ 
(897) and total number of responses involving the use of form (-942), 
and its highest negative loading on ‘failure to respond’ (—-811), 
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The second factor appears to be the neuroticism factor ong 
would expect to find very prominent in such a sample of children; 
it has its highest loading in the normal-neurotic criterion (789) 
and is distinctly proportional in its loadings to the D column. T е 
third factor appears as one of intelligence, with high loadings 


on LO. and cultured and unusual responses like architecture, 
. mechanical and scientific, and geology. 


'The fourth factor depends for its interpretation so much on 
Rorschach mythology that only the most tentative explanation 
seems in order. It appears to contrast the individual who meets 
the environment either with a 
violently, by emotional outbursts and point-blank refusal to co- 
operate, with the individual who experiences the environmta 
without attempting to control it, being too absorbed in his persona 
difficulties. ‘This interpretation is based on the implications 10 
Rorschach terms of the Geology and Mountain response, which 18 
suggestive of superficial eva: 
and the pure Colour responses indicating crude emotionality. These 
characteristics may be conceived as 
inferiority, anxiety, depression, 
passivity, implied by the chiaroscura and vista responses. Т he 
similarity of this ‘dynamic’ interp 
dichotomy is obvious, 


Having thus shown the presence ofa strong factor ofneuroticis™ 
in the test data, a discri 
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in the factor analysis, i.e. in a type of analysis in which no such 

maximization of error takes place, suggests that our figures err if 

anything in underestimating the possible usefulness of this test for 

the detection of potential neurotics. It should also be borne in 
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mind, of course, that the criterion itself was far from perfect and 
that higher validity coefficients could be expected if some way 
could be found to purify the criterion. / е xS 2 
This analysis raises important theoretical points. Projective 
experts and Rorschach analysts will undoubtedly protest that by 
8.8.Р, м 
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thus treating the Rorschach as an ordinary psychometric device; 
* violence is done to the underlying principles of this technique. 
They will point out that when a record is being analysed, the 
meaning of a given score changes subtly in accordance with the 
` context in which this score is found, so that we must never treat à - 
given response ‘analytically’, i.e. in isolation, but must see it as 
part of a gestalt. Failure to do so robs the Rorschach test of its 


most distinctive features, and falsifies the picture which may be 
obtained from it. 


The atomist's reply would 
would point out that it is 


ided in the experiment.) ү 

Below are given the results for expert M (Tabl XXVIIA). It 
noted that out of 116 children, 58-6 per КЕҢ are correctly 

ed, a result Which is significant at the 5 per ае, when 
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the one-tail test is used (¢ = 1:764). ‘Definitely’ neurotic or normal 
identification is no better than ‘probably’, neurotic or normal .. 
identification. The correlation between prediction and actual status 

is -17. These results are hardly encouraging. 


TABLE XXVIIA 


] П 
| Actual Classification 


Classification 
by Expert M т 
Neurotic | Normal | Correct 
5 | ! 
| % 

Definitely neurotic 15 | бо a 
Probably neurotic 19 14 58 
Probably normal 15 | 26 63 


Definitely normal | 9 | 8 47 


Results from the other Rorschach expert show that of 96 chil- 
dren, 23 were correctly classified as normal and 23 as neurotic: 
25 normal children were misclassified as neurotic, and 25 reurotic 
children disclassified as normal. The result, it appears, is slightly 
though not significantly worse than might be expected. by chance. 
(It'should be noted that when this expert sorted protocols without 
having to pay attention to the condition that 50 per cent of the 
children were normal and 50 per cent neurotic, he obtained very 
much better results, chi squared being 10-7. It is difficult to account 
for this difference.) 

No finality is claimed for this comparison between the atomistic 
and the gestaltist methods of approach, nor do we claim that the 
overwhelming superiority of the atomistic method would neces- 
sarily be found in future work. This particular comparison was 
introduced as an afterthought, and did not form part of the original 
design of the experiment; consequently certain controls are lacking 
which would be essential if the results were to be regarded as in 
any sense definitive. It is included here mainly for the purpose of 
pointing out that theoretical points of view, such as those of the 
atomist and the gestaltist, can be settled on the empirical level, 
and are capable of being submitted to controlled experimentation; 
twenty years of verbal argument have not produced the consensus 
of opinion in this field which ought to follow a series of well- 
designed studies. It is, therefore, the method rather than the result 
of this research on which we wish to lay stress: Those who believe 
in the gestaltist approach, and want to remain in the realm of 
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Science, must sooner or later submit their hypotheses to a strict 
experimental test of the kind here outlined; semantic argument 
along well-worn lines is not likely to make their point of view 
acceptable to experimentalists who look rather for empirical 
demonstrations. 

While these results and considerations make it appear feasible 
to consider techniques such as the Rorschach in terms of orthodox 
test construction, application, and validation, it should be stressed 
that_in many respects these techniques fall behind legitimate 


namely that of reliability. The literature here is in such a confused 


state that it is almost impossible to derive any agreed conclusions. 
Thus in one breath Rorschach 


ә : ubjects, in an 
attempt to obtain evidence on the reliability of Ше Rorschach 
Scoring categories (Meadows, 19 re would сет 
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are significantly different, then it is impossible to maintain the 
hypothesis of comparability. Table XXVIII gives means and 
sigmas for eight scoring categories (R, W, D, M, F, C, A, and P), 
for the normal, abnormal, and total groups respectively. Com- 
parison shows the close similarity of the figures for the two tests. 
In addition to the 8 scoring categories quoted, another 27 were 
in actual fact compared, for normals and abnormals separately, 


TABLE XXVIII 
MEAN RORSCHACH SCORES AND SIGMAS, AS COMPARED WITH MEAN 


BEHN-RORSCHACH SCORES AND SIGMAS, FOR NORMAL, ABNORMAL, AND 
TOTAL GROUPS 


Scoring Category | XR XB | oR oB 
N 24°40 25:62 10-17 11:25 
к {a 19:00 19:83 10:26 10:16 
T 21-80 22:76 11:59 11:10 
М 8:02 | 8:24 3:36 3:58 
W XA 6:31 5:60 3:24 2:74 
T 720 | 694 341 3:46 
15:05 1606 (|^ 10:07 9:26 
D 4A 11:50 13:10 8:65 8:47 
T 13:40 1455 |? gôr g'or 
N 2:08 1-82 1:44 1:68 
M 4A 99 1:12 1-21 її 
т 1:55 1:47 144 1:53 
N 11:24 10:04 8-28 6 
F ЗА 8-25 9:15 631 Dua 
ay 9:80 10-01 758 6:3 
хэв —_—_|—— 
N 1:56 1:2 144 1:31 
G {А бо -б3 1:02 1:07 
18 1:09 93 1-34 r2 
N 7-89 10:08 415 4:68 
A {А 6-75 8-22 3:94 434 
T 733 915 4°09 4761 
N 448 423 1:62 шор 
Р {А 3:38 3:37 67 97 
T 3:99 3:80 1:65 166 
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making a total of 70 comparisons. Out of these 70 comparisons of 
values for the means, three were significantly different at or be- 
yond the 5 per cent level. This is roughly what would be expected 
_ by chance (it is impossible to give an exact value to the chance 
hypothesis as the scoring categories are of course not independent); 


consequently there is little reason to assume that the two tests are 
not comparable. 


TABLE XXIX 


RELIABILITY COEFFICIENTS (RORSCHACH v. BEHN-RORSCHACH) FOR 
NORMAL AND ABNORMAL GROUPS 


Category AM “А А: | Category N: 4: 
b | pp Вэ өс 

R (£) -83 86 || FM (Е) -36 :54 
юм (| 8 mt [Ап (0) 35 51 
на биш с) 8 41, |G (£) 34 '52 
р # f) 74 81 | ш (f) `33 52 
Ad c) Wi 54 | ААГ (с) “31 62 
5 -(0| 69 40 416 (f) 30 58 
А (с) *63 :Во | ЕК (£) ‘30 35 
ЕС (£) :62 ?5 ра (f) 28 :ба 
н (0| 57 57 A Obj. (с) 26 ‘57 
уу (£) 53 -бо | Pl. (c) -26 43 
M (f) 152 -бо Blood (с) оо -62 
Ес (f) `50 „`45 ра (Е) оо +59 
Obj (c) “50 E Sm. СІ. (с) 18 24 
K (f) 47 `58 K (£) "4 "33 
Боов © 45 “52 c f) “з —10 
c "40 40 СЕ (| -о -39 

Sx.At. (c) 39 “65 Fire с - р 
Р (f) 3B a | (c) 05 40 


f — formal score 
€ — content score 
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further category (Е) gives a correlation of over -70 for both normal 
| and abnormal groups, and one category (А) gives a correlation of 
-over :бо. Five correlations (Ad., Н, УУ, M, and Obj.) are over -50 
| for both normals and abnormals. A considerable number of cor- 
relations is in the low twenties or below, and some are actually 
negative! These results should be borne in mind by those who base 
their interpretations on minute differences in Rorschach scores; 
the possibility that these differences are merely fortuitous, and of 
no fundamental importance, cannot be disregarded. Б 
Indeed, і we were to advance an hypothesis to account for the 
superiority in the experiment described on page 163 of the statistical 
over the intuitive approach, it would be in terms of the tendency 5 
of the expert to pay attention to, and base his interpretation оп, 
minute idiosyricrasies, odd phrasings, and slight deviations from 
{һе normal which have little reliability, and consequently little 
ii validity, while giving less weight to the more reliable and valid ^ 
scoring categories which are picked out by the factorial—statistical 
approach. The same hypothesis has been advanced by Kelly and 
Fiske (1950) to account for a similar failing of the projective- 
intuitive approach in the large-scale Michigan experiment into 
the selection of Clinical Psychology students. “The essence of 
clinical evaluation and integration of data involve permitting the 
clinician to assign to each item of opinion “‘beta weights”, which 
vary from case to case according to the clinician’s perceived pat- 
terning of the data. Our findings suggest that this technique may 
result in increasing the ratio of error variance to true variance 
with successive ratings based on increments of information. This 
may lead to a subjective feeling of increased knowledge about the 
assessee without a parallel awareness of the fact that many of the 
additional items of information are not actually correlated with 
the criteria, and hence should not be weighted in arriving at a 
prediction about the assessee.’ \ 

- This chapter has shown.us that at least one dimension in {һе 
non-cognitive field can be measured with sufficient accuracy to 
make it possible to proceed to more formal experiments on the 
causation and the correlates of ‘neuroticism’; we must now turn to 

` a detailed consideration of the available evidence in these fields. 


a 
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Chapter Five 


HEREDITY AND ENVIRONMENT 


ў 


in the determination of an individual's personality. If we accept 

the well-known definition of personality as ‘the integrated organ- _ : 
ization of all the cognitive, affective, conative, and physical char- 
acteristics of an individual as it manifests itself in focal distinctness 
to others’, we might expect that much research endeavour would 
have been dedicated to the discovery of hereditary influences on 


the cognitive, affective, conative and physical characteristics of 
the individual. A certain 


but its emphasis has been 
into inheritance of physic 


tance of cognitive abilities, but there has been little worth-while 
research into the conative a: 


Г is commonly believed that heredity plays a considerable part · 


twins with that of fraternal twins. The difference 
twins, due to environment 


between fraternal twins, due to both heredity a 
differences between fraternal twins 

identical twins, heredity app 
while if differences are small or non-existent, the influence of 
heredity as a causative factor in individual di 

It is possible to give mathematical ex 
contribution of heredity and 


The large amount of research done along these lines into the 
inheritance of intelligence has been summarized adequately by 
Verschuer (1939), Schwesinger (1933), Newman, Freeman, and 


Holzinger (1937), Gottschaldt (1939), and Woodworth (1941). 
168 
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The fairly universal conclusion has been that ‘inter-family environ- 
mental differences account for a much smaller proportion of the 
variance in intelligence than do hereditary differences’ (Shuttle- 
worth, 1935). No equally comprehensive generalization has 
hitherto been made in the affective and conative fields, if we except 
the rather negative conclusions arrived at by Newman, Freeman, 
and Holzinger (1937) who say: ‘In most of the traits measured the 
identical twins are much more alike than the fraternal twins, as 
indicated by higher correlations. This is true of physical dimen- 
sions, of intelligence, and of educational achievement. The only 
group of traits in which identical twins are not much more alike consists of 
those commonly classed under the heading of personality. . . . The differ- 
ence in resemblance of the two classes of twins is not the same in 
the different groups of traits. In general, the contrast is greater in 
physical traits, next in tests of general ability (intelligence), less in 
achievement tests, and least in tests of personality or temperament. 
In certain instances, viz. tapping, will-temperament, and neurotic 
disposition, the correlations of identical twins are but little higher 
than those of fraternal twins.’ 

A brief review of such experimental and observational data as 
are available will indicate some of the reasons for this failure to 
achieve positive results comparable to those achieved in the cogni- 
tive field, and will also make us familiar with certain dangers in 
twin research which have invalidated many conclusions confidently 
drawn from methodologically inadequate data. 

Much the most extensive work on the inheritance of person- 
ality has been done in the field of mental illness, where twin studies 
of psychotic (and occasionally neurotic) patients have been widely 
accepted as a method for elucidating the influence of heredity on 
pathology. The work of Rudin (1930), Rosanof e£ al. (1935, 1941), 
Essen-Moeller (1941), Luxenberger (1928, 1930, 1933, 1934» 1935, 
1940), Kallmann (1941, 1946), and Kallmann & Barrera (1942); 
leaves little doubt that concordance in identical twins is consider- 
ably more frequent than in fraternal twins, although still far from - 
perfect. This type of work inevitably suffers from the subjectivity 
of psychiatric diagnoses, and the lack of reliability associated with 
all rating methods. 

À quite different field, which also has attracted some attention, 
is that of criminal tendency. Lange (1931), Stumpfl (1936), Kranz 
(1936), and Borgstroem (1939), have shown that identical twins 


1 Italics not in original. 
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are more concordant in their criminality than are fraternal twins, 
even when complete separation has taken place. Popenoe (1936) 
concludes from a review of this material that *we must ascribe to 
heredity a more important role in the production of crime than has 


hitherto been the case’. 


~“ When we come to the more definitely experimental type of 
study, we find that the relevance of the experiment to the question 
of heredity of personality is usually contingent on the theory of 
personality held by the investigator. Conditioning experiments 


appear important to the followers of Pavlov; eidetic imagery to ~ 


those who agree with Jaensch's system; handwriting is studied by 


~ the ‘expressive movement’ school; autonomic patterns are investi- 


& 


gated by the physiological experimenters; perseveration and 
fluency by the Spearman school; projective techniques are- 
employed by the followers of Rorschach and Freud. 

Much work has been carried out in the field of handwriting, 
both along subjective, interpretative lines and along objective, 
quantitative lines. The picture is somewhat confused. Bracken 
(19404 and b) found no hereditary determination for differences in 
handwriting pressure in 42 pairs of twins, while Carmena (1935) 
and Miguel (1935) came to the opposite conclusion after experi- 
menting on 50 pairs of twins each. Writing speed was found to be 
determined largely by heredity in Bracken’s (19394) study of 38 
sets of twins, a conclusion confirmed in a later study by the same 
author (Bracken, 19404) on 42 pairs of twins, Thelen (1939) con- 
ducted matching experiments which showed hereditary factors to 
be prominent in handwriting, while Hartge (1936) found hand- 
writing to be of no diagnostic value in individual diagnosis of 


and Hermann (1939) agree in 
nation in handwriting, with thc 


ceptio i - Saudek & Seeman (тозо, 1 33) 
emphasize both heredity and environment in the LE E e 


п, Freeman, & Holzinger ( 1937) 


finding little hereditary determi 


quality of the handwriting 
h (1998) ). 

8 studies are three investigations 
nt test, which is based essentially 


c | ` Тагсзау (1939) reports negative 
conclusions, while Bakwin (1930) is more positive, er Free- 
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man, & Holzinger (1937) report findings which definitely disprove 
the hypothesis that heredity determines individual differences in 
reaction to this test; the intraclass correlations on the four scales of 
this test used by them are higher for fraternal than for identical 
twins! (The actual values are -69, -36, :53, and +51 as against 45, - 
'31, “51, and -48.) at 

Perceptual factors in the personality of twins have been studiéd 
by Bracken (1939с), Hofstetter (1948) and Smith (1949). Eidetic 
imagery, which according to Jaensch’s theories is closely connected 
with personality type, was shown by the last-named author to be 
strongly determined by hereditary factors, as was also reaction _» 
time. The other authors show that in the production of after- 
images, in the extent of visual illusions, and in accommodative 
convergence heredity plays a powerful part. 

A beginning has been made in the study of conditioning in ~ 
twins by Kanaev (1934, 1938, 1941), who used the Krasnogorsky 
modification of Pavlov's method to show remarkable similarities 
between identical twins. He links this approach to Pavlov's theories 
of personality, which are based on conditioning experiments, and 
itis unfortunate that his suggestive work has been left in a relatively 
undeveloped state where no certain conclusions can be drawn. 

Another physiological variable which has been studied on the 
hypothesis that it might be found to be correlated with personality 
differences is the brain-wave pattern. Lennox её al. (1945), Elm- 
gren (1941), Davis'& Davis (1936), and Gottlober (1938) have 
shown remarkable similarities between identical twins, and less 
marked similarity between fraternal twins; in the absence of any 
satisfactory theory linking brainwave patterns to personality, how- 
ever, it is difficult to interpret these findings. 

Of more definite relevance to personality are three studies of а 
yet a third physiological variable, viz. the psycho-galvanic reflex. 
The important monograph by Wenger (1948) has definitely estab- 
lished the close connection between neuroticism and autonomic 
imbalance, and has shown that the P.G.R. is a good measure of 


‘autonomic imbalance. Carmena (1934, 19354), working on бо 


pairs of twins, showed that the P.G.R. is strongly influenced by 
heredity. Jost & Sontag (1944) supported this finding by using, in 
addition to the P.G.R., pulse pressure, salivation, héart period, 
respiration rate, vasomotor persistence time, and other autonomic - 
measures. They conclude that an autonomic constitution may be 
at least partially inherited. 
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Ratings along the lines of the Vineland Social Maturity Scale 
have been carried out by Bracken (1939), Troup & Lester (1942), 
and Wilson (1941); there is considerable agreement that whatever 
is measured by this scale is influenced to a considerable extent 
by heredity. While it is believed that the Social Maturity Scale 
measures personality factors additional to intelligence, it has not 
hitherto been possible to identify them, and therefore it is difficult 
to interpret the findings. 

Motor skill has been studied by Brody (1937) and by McNemar 
(1933). Both authors find identical twins much more alike than 
fraternal twins, and conclude that the hereditary hypothesis is the 
most plausible explanation of individual differences in motor skills. 
This finding is relevant to personality research because it has been 
shown by Eysenck (1947) that motor skills correlate quite highly 
with neuroticism, so that hereditary determination of individual 
differences in skill give presufnptive evidence for hereditary deter- 
mination of individual differences in neuroticism. 

Related to this work 
and Pauli (1941) showin 
pronounced in fr; 


type heredity played an im- 
ts of twins, found that b Bp 
no : uence. Tests of F (fluency О 

C rd Were also given by Cattell & Molteno, vun 
that family- es 'are about 8 times as import- 
€s In accounting for individual 


sion links up with the work of 
Carter (1938, 1939) and Sorensen & Carter Gain on association. 
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sufficiency and dominance. This conclusion does not agree with 
the results published by Newman, Freeman & Holzinger (1937) 
with respect to another neuroticism questionnaire. The Rorschach 
technique has been used by Troup (1938), Eckle & Ostermeyer 
(1939), Marinescu ед al. (1934), and Kerr (1936). Results are con- 
flicting, Kerr's being essentially negative, Marinescu'$ positive, 
while the other reports are somewhat intermediate. No clear pic- 
ture emerges from the combined results, however. 

Certain techniques have been used by one investigator only, 
and while the connection between the test used and personality is 
not always clear, the results illustrate a point in methodology which 
we want to stress in the discussion. Szondi (1939) found greater 
similarity in identical twins when he applied the test that bears 
his name to 97 pairs of twins. Petó (1946) used psychoanalysis as 
his method of investigation, finding surprising identity of symptom 
in two identical twins. Malan (1940) found spatial orientation to 
be an inherited trait. Hunt & Clarke (1937) showed marked differ- 
ences in the startle pattern of a pair of twins. Carter (1932) found 
occupational interests to be due in part at least to hereditary 
causes. Frischeisen-Kóhler (1933) reports that personal tempo is 
definitely conditioned by heredity. Thompson (1943) showed deter- 
mination of play-behaviour Бу, heredity. Waardenburg (1929) 
showed greater similarity between identical twins with respect to 
likes and dislikes. Zilian (1938) found less variability for identical 
twins on imaginal and motor factors. Steif (1939) found great 
similarity in scribbling between identical, little similarity between 
fraternal twins, a result similar to Luchsinger's findings with re- 
spect to voice range (1940). 

Certain obvious characteristics emerge even from this very brief 
review of the literature dealing with twin studies in personality 
research. (1) Objectively oriented investigations are mostly very 
limited in scope, dealing with traits of a low order of complexity 
such as scribbling, angle of handwriting, eidetic imagery, reaction 
times, or spatial orientation. (2) When an attempt is made to study 
higher-order concepts, such as criminality or psychosis, the con- 
cepts chosen are of a sociological-ethical, or psychiatric rather 
than of a psychological nature, and the investigation proceeds 
along lines far removed from the objectivity of psychometric test- 
ing procedures. (3) While on the whole most authors agree that 
identical twins are more alike than fraternal twins on most of the 
tests used so far, there are many inexplicable contradictions (e.g. 
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the studies of perseveration, of questionnaire tests, and of the 
Rorschach). (4) Most personality tests have such low reliabilities 
that results are almost bound to be disappointing; correction for 
attenuation is seldom attempted. (5) Even when results are clear- 
cut, they are often difficult to interpret due to our lack of knowledge 
of precisely what it is a given test is measuring. 

Some of the conflicting results reported may be due to technical 
faults which vitiate many of the studies reported. Results are often 
stated as impressions, rather than being reported as objectively 
scored and statistically validated conclusions. Choice of twin-pairs 
to be tested is often based on faulty sampling practices, fraternal 
twins which look unlike each other being overlooked in favour of 
those who resemble each other. (Correct procedures are suggested 
by Verschuer (1939) and Rosanoff et al. (1937).) Diagnosis of 
monozygoticity or dizygoticity has often been faulty, even where 
the procedures adopted have been described in full. These and 
other technical faults are easily overcome by experimenters of 
reasonable competence. Two other criticisms are more funda- 
mental, and must be discussed in some detail. 

(1) In passing, we have noted that the whole procedure of twin 
research rests on the assumption that the environment is as similar 
фт ne of fraternal twins as it is for a pair of identical twins- 
p ED ЫТ" (1930), Bracken (1933, 19344, 19340, 1935» 
cm (1934), and Jones & Wilson (1933) present reports 
вла identicals are treated more alike than are fraternals, 
oer ie M rper to invalidate this assumption. Bluc- 
ur on "D , s meyer (1935), and Misbach & Stromberg 
MAN api эн kand, show that often the very fact of the 
that they take Men Чэн {о different development їп the sense 
man et al. (1937) ha тар ementary roles. Schiller (1937) and New- 
Bounterwek (8 6) d hee this point. Meumann (1935) and 
select different occupations. Wosdenren d ыж. es 
n E ba : orth (1941) comments tha 
such differentiation of roles as has been observed would probably 

1 d ain special abiliti lit 
пы а апу сазе, аз Wilson (1934) has du eir A 
в у ов the identical pairs live under тоге similar conditions 

an the fraternals, *this fact must be attributed ultimately to the 
> 


influenge of their heredity which led, or forced them to “select’ 


is impossible to be dogmatic 
argument against similarity 
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of environment for the two types of twins is speculative and hypo- 
thetical; there is no evidence to suggest that such differences as 
may exist are not themselves due to hereditarily determined selec- 
tion of environments, or that the small differences observed could 
account for the large differences in test results. Until more em- 
pirical evidence is produced by the critics we cannot concede that 
their arguments do much damage to twin research methodology. 

(2) The following criticism of twin work has not to our know- 
ledge been made previously, although a solution to the problem 
posed by it has been published elsewhere (Eysenck, 1950). It is 
relevant not only to studies of temperamental traits, but equally so 
to work on intelligence and would seem to undermine the elaborate 
structure of argument built up on twin research. Essentially, this 
criticism concerns a conceptual jump which takes place when an 
argument is presented regarding the inheritance of intelligence from 
the intercorrelations of identical and fraternal twins on a particular 
test, say the Binet. We may generalize and say that a demonstration 
that individual differences in performance on a given test are due 
to heredity cannot be used as proof that a hypothetical trait or 
ability imperfectly measured by that test is inherited. The argu- 
ment can best be presented by using an algebraic model. 

We may write the factorial equation of the Binet test in the 
following form: : 


O BINET = o5? + ay? + a? + Ogp? + Oy + 
Oot... ox? + ag? Tog 


where c?BINET denotes the total variance of the Binet test, og? the 
contribution to that total variance made by ‘g or intelligence, 
whileay?, o y?, ogp?, бам, Co? . . ay? denote contributions to the vari- 
ance by verbal, numerical, spatial, memory, comprehension, and 
other group factors, and os? and ср? stand for the contribution to 
the variance of specific and error factors. Using estimates derived 
from Burt & John's (1942) analysis of the Binet, the total variance 
contributed by ‘g’ or intelligence is only about 30 to 40 per cent, 
or less than half the non-error variance. McNemar's (1942) series 
of analyses attributes on the average 40 per cent of the total vari- 
ance to ‘g’, the proportion ranging from 35 to 50 per cent. If we 
neglect the error variance, which amounts only to about 5 per cent 
and of course cannot be said to be caused either by heredity or 
environment, but which is merely an error of measurement uncor- 
related with the abilities or traits the test is measuring, we can 
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conclude by and large that one-half of the variance at most is due 
to ‘g’, while at least one-half is due to various other common factors 
or specifics. 

It will be clear now how unjustified is the jump from the state- 
ment ‘individual differences in Binet scores are accountable for in 
terms of heredity to the extent of 80 per.cent, which, granted 
certain assumptions, is a statement of fact, to the much more usual 
statement that ‘individual differences in intelligence are account- 
able for in terms of heredity to the extent of 80 per cent’, which is 
a completely unwarranted generalization which could be made 
only if all the non-error variance were attributable to ‘g’. It could 
be that all the group and specific factors hypothesized were com- 


pletely determined by heredity, in which case heredity would play. 


only a very minor part in the determination of ‘g’; it could be that 
the group and specific factors were largely caused by environ- 
_ mental differences, in which cese ‘g’ might be тоо per cent inherited. 
Unless we can analyse the total variance of a test into its constituent parts, 
and measure these parts separately, no scientifically tenable conclusion can be 
drawn from the data. If this criticism be justified, it follows that the 
whole literature on the inheritance of intelligence, perseveration, 
social maturity, motor skill, conditionability, personality type, 0 
any other ability or trait, in so far as it is based on twin studies, 
ШШ! be considered invalid. This conclusion may appear harsh in 
Men but s is difficult to see how it can be avoided on the 
QUIA ар p what has been said above that if we want ue 
in a gi gree to which a particular trait or ability is inherite 
given sample, then we must study, not the individual test 


ae эв the hypothetical underlying factors which 
шэнэ ed 1 roc In studying intelligence; that wou 
ТЭЙ е ps a uo of tests to the experimental popula- 
A ae ing these tests, factor-analysing the resulting 
atrix, and obtaining factor scores for each experimenta 


subject on each factor isolated. We could then submit these facto” 
Scores to the mathematical treatment appropriate т л roblen 
and obtain data relevant not to one test only, but to intelligen ps 
or verbal ability, or memory, or whatever our fa ight tur? 
out to be. ее. 
р" the field of cognition, the main factors underlying test per 
ance have been isolated by Spearman, Thurstone, Holzing& 
and other experimenters using the method of Топ analysis: 
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We know now what to measure, and we know how to measure it. 
In the fields of conation and affection, however, the position is less 
clear. There are numerous theories, but few facts; much is hypothe- 
sized, but little known. The problem thus arises as to the correct 
choice of the experimental variable. 

Our choice has been determined largely by two considerations: 
(1) The social importance of the trait investigated, and (2) the 
existence of a sufficient body of knowledge regarding it. On both 
counts we had little difficulty in arriving at our decision to study 
the inheritance of the trait variously named neuroticism, emotional 


instability, or lack of integration, and discussed in detail in earlier _ 


chapters. We may state the hypothesis investigated in this paper in 
a formal manner: The trait of neuroticism, as operationally defined 
in terms of the pattern of intercorrelations between a specified set 
of objective personality tests, is in large part determined by 
heredity, and such individual differences with respect to it as 
appear in the experiment cannot be accounted for in terms of 
environmental influences. . 

The relevance of the experiment to the psychological and psy- 
chiatric fields needs little stressing; it may be worth while, how- 
ever, to point out the importance attaching to the result from the 
point of view of the logic of factorial analysis. It will have been 
noted that our definition of neuroticism 48 essentially in terms of 
factorial analysis, i.e. in terms of the condensation of a set of 
observed correlations into a smaller number of hypothetical under- 
lying variables or factors. This method has frequently been criti- 
cized on various grounds, and while some of these criticisms have 
not always been based on thorough knowledge of precisely what is 
implied in the method, it cannot be denied that some doubts could 
not be allayed in terms of statistical arguments alone. In particular, 
the fact that the resolution of a given matrix of intercorrelations 
into factors can be carried out in an infinite number of ways has 
perplexed many critics otherwise not hostile to this approach. 
Thurstone’s method of overcoming this difficulty (a difficulty which 
is not faced at all by some writers like Burt and Stephenson) is 
well known; it consists essentially in overdetermining the solution. 
This method, while of the utmost importance in work on abilities, 
appears less well suited to the requirements of non-cognitive experi- 
mentation, and the method of ‘criterion analysis’ has been suggested 
as a plausible alternative (Eysenck, 1950). 

However, proof lias been lacking hitherto that factors thus 
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determined have апу 'real' existence, and are something other 
than mere ‘statistical artefacts’. In part the argument about real 
existence is of course a philosophical and semantic one; in a vem 
definite sense any scientific concept is an ‘artefact’ lacking. real 
existence. A concept, whether it be that of an electron, an instinct, 
a quantum of energy, a complex, or a sound wave, is an abstrac- 
tion, and thus not ‘real’; scientific concepts are ‘artefacts’ almost 
by definition, However, what critics usually mean by this objects 
is something quite different. They denote a factor an ‘artefact 1 
it does no more than merely summarize existing knowledge, if it 
“does not go beyond the circle of its own derivation. 7 
__ Неге the present experiment should be of crucial importance. 
Having defined our factor of neuroticism in terms of the intercor- 
relations of a set of tests, we proceed to examine the biological unit 


of this factor by analysing the degree to which the factor as a whole 
is inherited. The crucial 
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attract attention and are brought to the investigator's notice, while 
those differing considerably in appearance and behaviour may be 
overlooked. The questionnaire method of sampling is even more 
likely to yield a sample overloaded with fraternal twins who are 
very much alike. Probably the most adequate method of securing 
an unbiased sample is to use. the birth record method (von 
Verschuer, 1939). This was the method used in the present study 
(Eysenck & Prell, 1951). 

The birth records for five boroughs in south London were 
searched for all twins of the same sex, born during the period 
1935-1937. The reason for selecting only like-sex twins was that 
identical twins of necessity belong to the same sex. If then, fraternal 
twins of the opposite sexes had been included, they would have 
introduced a possible complication due to sex differences. Con- 
cerning the age limits, the lower limit was set because children 
younger than 11 could not have taken all of the tests; the upper 
limit was set because a wide age-range necessitates statistical 
corrections for age which complicate the picture. 3 

The survey of the birth records yielded the names of 130 pairs 
of like-sex twins. From these it was possible to locate 68 pairs who 
were living in the London area, close enough to be able to attend 
the Psychological Laboratory of the Institute of Psychiatry. The 
remainder were either living too far away, had died, or could not 
be located. In no case was parental permission to test refused. 
The twins were examined as they were located. After examination 
they were classified as identical or fraternal according to a pro- 
cedure to be described presently. 

Although there is no longer any doubt of the existence of two 
types of twins, efficient criteria are needed in order to effect a valid 
separation in all cases. Two methods of diagnosis have been used 
to group identical and fraternal twins: the foctal-membrane method 
and the similarity method. In view of the many criticisms made of 
the former, the latter was employed. : 

The similarity method involves the comparison of the members 
of a pair of twins in respect to numerous physical characteristics 
which are determined by heredity. As the number of characteristics 
is increased arithmetically, the chances of any two siblings notbeing 
alike on all the characteristics is increased geometrically. There- 
fore, if the chances of two children in the same family being alike 
in one such characteristic is one in two, the chances of their being 


alike in ten is one in one thousand. 
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: ten ich the 
In the present investigation the set of criteria upon which 5 
diagnosis of zygosity was made is as follows: 


(1) Close resemblance of ears, teeth and facial features. ! 

(2) Iris pigmentation. 

(3) Standing height. 

(4) Presence or absence of mid-digital hair. 

(5) Ability to taste phenyl-thio-carbamide. 

6) Scapular shape. 2 ell 

z To (а Blood groups A,A,BO, Rh, MNS, P, Lewis, K 
and Lutheran. 


; es was 
(1) Close resemblance of ears, teeth, and facial shi: pro- 
rated on a three-point scale; no resemblance, differen ery Pro- 
nounced resemblance, but slight differences (SD); ди tinguis 
nounced resemblance rendering it almost impossible to dis 
the twins, same (S). > a three- 
(2) The resemblance of the iris pigmentation was rated on þlancê, 
point scale; no resemblance, different (D); pronounced esemi н 
but slight differences in one zone (SD);! and very PO AB his 
resemblance, rendering it almost impossible to disting 
twins, same (S). 
(3) The standing 
quarter of an inch. 
* (4) Presense or absen 
region of the fi 
half clenched 


t 
eares 
height of each twin was measured tothe n ! 


: t 1 ital D 
ce of hair on the dorsum of the mid-dig 


electric light p 
of any hair on 


(5) Ability to taste bhenyl-thio- 
the subject drink one-quarter 


1Tn the majority of individuals, the iris is composed of two zones, an inne? 
and an outer óf different pigmentation. In fifty pairs of siblings studied by Rife 
(1943), no two pairs were found to have the Same iris pigmentation in rega” 
to both zones, 


although in ten pairs the outside zones were the same, and 10 
four pairs the inside zones were the same. 2 
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type included any combination of the other three: (CC-CV), 
(CV-S), etc. 

Grouping the Twins into MZ and DZ. Using the data on the 14 
criteria, the twins were first classified into one of three groups: 
definite- MZ, definite-DZ, and doubtful. A pair of twins was considered 
definite-MZ if they had (5) in criteria (1) and (2); differed less 
than 1:5 inches in height on criteria (3);* and if the children agreed 
exactly on criteria (4) to (14) inclusive. Twins were considered as 
definite-DZ if they were rated (D) on both criteria (т) and (2); 
or if their height differed by more than 3:5 inches, criterion (3); 
or if they differed on any one of criteria (4) to (14) inclusive. Twins, , 
were considered doubtful if they had ratings of (SD) on both criteria 
(1) and (2), or a rating of (SD) on either (1) or (2) and a rating of 
(S) on the other; if they differed 3:5 inches or less on criteria (3); 
and if they agreed exactly on criteria (4) to (14) inclusive. 

On the basis of the above procedure, 20 pairs of twins were 
classified as definite- MZ, 24 pairs as definite-DZ, and 6 pairs as 
doubtful. : 

When a pair of twins had been rated as doubtful, the blood 
groups of both parents were ascertained in order to effect a final 
classification.? The blood groups of the twins and their parents 
were then compared to determine the chances of the blood groups 
of two siblings from a known mating being alike on all of the blood 
groups. In five of the doubtful pairs, the chances of the two children 
being alike on all.of the blood groups, taking into account the 
blood groups of their parents’ blood, were: 1:256, 1:256, 1:1000, 
1:1000, and 1:2000. Accordingly, these pairs of twins were classified 
MZ. In the other doubtful pair the chances were: 1:16, therefore 
this set was classified DZ. 

The research design calls for a criterion group of neurotic chil- 
dren against which the factor extracted from the normal twins 
could be validated. Twenty-one children born between 1935 and 
1937 were selected from out-patients at the Maudsley Child 


1 Newman, Freeman, & Holzinger (1937) found that 94 per cent of their 50 
sets of MZ twins had pair differences in standing height of less than 1:5 inches; 
53:8 per cent of their 50 pairs of DZ twins had pair differences of less than 
1:5 inches. None of their MZ twins had a pair difference of over 3:1 inches; 
whereas 19:1 per cent of their DZ twins had a pair difference of over 3:1 
inches. E 
? Blood group tests were not made for the parents of all 50 pairs of twins as 
it was feared that an attempt to persuade the parents to submit to a *blood-test? 
might have resulted in a loss of the co-operation already secured. 
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Guidance Clinic, Great care was taken to exclude — M 
organic complications, or with possible psychotic in ine nim 
Were not definitely-considered ‘unstable’ by the exam roaches 
chiatrist. The resulting sample of twenty-one children ce wledge - 
as closely as is possible at the present stage of psychiatric к; other - 
a ‘pure’ neurotic group, with relatively little mixture 
mental or physical disorders, . Бие 
The following tests were used in this study: it should 2) sm - 
bered that the choice was made before our knowledge of thi 


tter 
: She uch be 
` was as extensive as it is now, and that consequently a m 


a 


: n 
A : ble whe 
choice could be made at the present time than was possi 


this experiment was being planned. 


Ре igit 
(1) Intelligence (Wechsler-Bellevue, Similarities and DIE | 
Symbol sub-tests). 
(2) Tapping Area (cf. рг 133). Ї 
(3) Tapping Speed (cf. photograph 22). 
(4) Level of Aspiration. h 12). 
(5) Motor Speed Test (Track Tracer, cf. photograp cards 
(6) Speed of Decision (choice of ‘higher’ of two playing h 21) | 
i lying face downwards on the table; cf. photograp? * 
(7) Static Ataxia. 
(8) Body-Sway Suggestibility, 
(9) Dynamometer Strength of Grip. 
(10) Word Dislikes, 
(11) Brown Personality Inventory (Adapted). 
(12) Lie Scale (adapted from M.M.P.I). 
(13) Flicker Fusion, 
(14) Autokinetic 
(15) Autokinetic 
(16) Speed of үү 
(17) Fluency. 


Movement, 
Suggestibility, 
riting S?s backwards, 


is ? urations r three 
significant factors, which leave onl insignifi use ын is AS 
given in Table XXXI are the correlati residuals. 
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criterion ! (these are biserial correlations; ай others are product 
moment correlations). Items 13 has been omitted from further 
calculations as it was impossible to give this test to the neurotic 
group. In order to rotate the factors so as to obtain maximum 
correlation with the criterion, those tests which showed negative 
correlations with the criterion were multiplied by — т as indicated 
in Table XXXI; this is merely a device which reverses the direc- 
tion of scoring of the tests affected, and leaves the data unchanged 
in any material way. The last column of Table XXXI gives the 
rotated factors. The new factor 1 correlates -758 with the criterion 
column, having been rotated into maximum agreement with it. 
Factor II was then rotated in such a manner as to preserve 
orthogonality with Factor I, and to take up all the remaining 
variance on the intelligence test. Factor III is irrelevant to our 
purpose, and no interpretation of it will be attempted. Interpreta- 
tions of Factors I and II are:straightforward and dictated by the 
results: Factor I is a factor of neuroticism, Factor II one of 
intelligence. 4 
ans and variances for the neurotic 
d the fraternal twins, on all 17 tests 
т the ‘neuroticism’ factor. The first 


ын у difference, at the 2 per cent level 
are more variable than uen Gaerne а 
are re i int t 
ЇНЭН ры between Toma нав: 
Table XXXIIL) each tost with this dichotomy, which is given in 
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B 5 E: e А 
ns variance is larger; two-of these differences are significant at 
es 2 per cent level (Static Ataxia and Neuroticism Factor Score). 

here seems to be little doubt that in our sample identical twins 


and fraternal twins have almost identical means with respect to 


neuroticism (23:20 and 22:96 respectively),! but that the identical 
Бар contains pairs of twins tending to be more extremely un- 
Sta по and more extremely stable, than the fraternal twin group. 
n able XXXIII shows the raw intraclass correlations between. 
wins of both types, correlations with age, and the partial correla- 
tions resulting from eliminating age differences. It will be seen that 
age plays little part in determining scores, and that correction 
leaves the correlations very much as they were before. The last 


column of Table XXXIII gives Holzinger's 4? values, i.e. the 


extent to which each test variance is determined by heredity. It 
will be seen that the test of intelligence has an 4? (:676) which 
is almost identical with the 4? values given by the best neuroti- 
cism tests: static ataxia (-692), autokinetic movement (648) and 
Suggestibility (+701). Е 

As the last step in this procedure, factor scores on the neuroticism - 
factor were calculated for each twin and 4? values calculated for 
these ‘neuroticism scores’. 
п en 

22 

1-0) (1 — n) 
Where r; — intraclass correlation for identical twins and ry = intra- 
class correlation for fraternal twins. It follows that 


(I=) = 


h? = 


‘within’ i variance _ 13:680 _ Хао Бой шей 
‘total’ 7 variance 91°551 
‘within’ f variance _ 32:480 
== 1 1 " 9 = - = 
ршн т *total f variance 41:468 


= +783, from whicli ту = :217. It follows that 4? = 4810. ‚ 
This value is considerably higher than that given by any single 
test, and indicates that the factor constitutes а biological unit 
which is inherited as a wholc. 3 

The A? technique used in 


this paper gives the percentage of 
butable to nature providing that 


1 These mean scores of the normal children should be compared with the 


mean score for neurotic children ( 13:34). There is relatively little overlap 
between the two distributions of scores, and the difference between the normal 


and neurotic group means is significant at a very high level. 
"i. t 
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certain assumptions are met. Two of these assumptions are that 
nurture influences are the same for both types of twins, and that 
differences due to nature are uncorrelated with differences due to 
nurture. It is probable that these assumptions are not completely 
met, and that consequently our estimate is too high. On the other 
hand, another assumption, viz. that the variance due to errors of 
measurement is negligible, is quite certainly not fulfilled, and in 
view of the known unreliability of personality tests we must assume 
that errors of measurement may play a considerable part. This 
would lead one to believe that the found h* would be an under- 
estimate of the true value, and it seems not impossible that this 
factor may cancel out the two previously mentioned. It is perhaps 
permissible, therefore, to argue that the 4? found is a rough and 
ready estimate of the contribution which heredity makes to 
individual differences in neuroticism.! 

Our conclusion regarding the size of this contribution is not in 
agreement with that arrived at by Newman, Freeman, & Holzinger 
(1937). Where they find that ‘the only group of traits in*which 
identical twins are not much more alike consists of those commonly 
classed under the heading of personality . . .’, we have shown that 
identical twins show a correlation on neuroticism of -851, while 
fraternal twins show a correlation of only :217. From this it was 
concluded that individual differences with respect to neuroticism, 
stability, integration, or whatever we may wish to call this trait or 
factor, are determined to a very marked extent by heredity, and 
very much less markedly by environment. This conclusion, of 
course, applies only in the general type of environment from which ~ 
all our twins came, and might not be applicable under conditions 
of more extreme environmental variation such as may obtain in 


x 


ed how strong an influence heredity exerts 


in at least one field of personality, we cannot close this chapter 
without drawing attention to certain important theoretical points 
which would seem to be directly related to this demonstration. 
These points arise largely out of what to us appears a somewhat 
unhealthy tendency in current psychological literature to repeat 
in the field of affection and conation the errors and misinterpreta- 
tions which vitiated the early work on intelligence. We are referring, 


other cultures. 
Having demonstrat 


of course, to the unconquerable tendency among certain writers to 


1 We have used Holzinger's A? statistic in our estimates because no better 
estimate of the contribution of heredity to the total variance is available. 
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interpret their results in terms either of hereditary or environ- 
mental influences, although the methodology of their research does 
not permit any conclusions to be reached on this point. We have - 
learned through the lost labours of many years that an experi- 
mental observation to the effect that dull parents have dull chil .. 
dren is irrelevant to the question: To what extent are intellectual 
differences innate? We know now—as we should always have 
realized—that both hereditary and environmental causes may 
with equal facility be invoked to explain the observation. But we 
do not appear to have learned to use ‘transfer of training’ and 
_ apply this lesson to other fields than the intellectual. The literature t 
- is replete with papers in which the fact that trait A in a group of 
children is found to be correlated with trait X in their mothers 15 а — 
· used as an argument to the effect that the behaviour characterized — 
Е by trait X is directly causative of the behaviour characterized by 
, trait A. No appeal to the elementary text-book principle that. 
correlation does not and cannot prove a direct causal relation appcats 
sufficient to stem this flood of environmentalistic interpretation, 
and even otherwise critical reviews often fail to point out the fact 
that both trait A and trait X may be due to a third variable, ' 
namely, an inherited tendency to behave in this manner, and may 
E be completely unaffected by environmental manifestations. 
E Such neglect of caution in interpreting results could. perhaps, 
... according to current psychiatric thinking, be traced back to 
. emotional causes and tie-ups which the present writer 15 not 
competent to disentangle. It is perhaps significant that both 
> кие qe e divergent as their paths per 
r to be, are at one in this res И n the 
apparent differences between the democratic ideals of the САГ 
and the U.K., and the dialectical authoritarianism which cha! 
acterizes the ‘dictatorship of the proletariat’ in the U.S.S.R. melt 
2 Бр. e U.S.S.R. 
away before this belief in the ‘equality of man’. Slater (1950) has 
put this criticism into particularly strong, but entirely justifiable - 


terms. "There has . . . beenanincreasi ici 
prim : ng tend, inicians 
to minimize the effects attributable. i gen. 
teach a psychiatry in which they receive little or auses, "Th 
tendency has been marked in Britain, but ith deos id- 
able strength in the U.S.A. Instead of on assume d рэн 
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апа more exclusively towards psychotherapy, psychoanalysis, social 
psychiatry, personnel selection, group therapy, and preoccupa- 
tions with anthropology, sociology, and political theory. In its 
one-sidedness, this development is not healthy. , 

_ It is a sign of bad omen that it is possible for text-books of 
clinical psychiatry to appear, with claims for comprehensiveness, 
in which no mention is made of the established facts of genetics and 
of the hereditary element in mental disorder. Their authors appear 
to feel, though in fact this view depends on a misapprehension, that 
recognition of a hereditary factor implies a. therapeutic nihilism; Я 
апа that an energetic and optimistic attitude towards treatment ? 
calls for a neglect of hereditary factors, just as a due appreciation. 
of the patient as an individual demands forgetfulness of nosological 
entities. One suspects that the prime motivation is derived from . 
the philosophy of Dewey, which is no doubt over-simplified in the 
notion that one should accept as true that which. has convenient 
practical applications. | 

‘It would not perhaps be putting it too high to say thatewe are 


; witnessing the manifestation of an anti-scientific tendency which 
is winning an increasing number of supporters. The customary 


canons of scientific reasoning are ignored by these schools. Uncom- 
fortable facts are left unconsidered. Hypotheses are multiplied — - 
regardless of the principle of economy. Explanations which may |. - 
be valid for certain members of a class of phenomena are regarded . pe 
as true for the class as a whole. Interpretations which conform 
with theory, and which might be true, are regarded as established. 
Possible alternatives are not considered, and no attempt is made — 
to seek for evidence of critical value which shall decide between 
them. Criticisms from outside are ignored, and only the initiate 
may be heard. Utterance is dogmatic and arrogant, and lacks 
scientific humility and caution. These are the mental mechanisms 
which we associate with the growth of a religious orthodoxy, and ss -< 
not with the progress of science. The movement is of significance 
to genetics, because it is likely adversely to affect the personnel 
and facilities for research, and to lead to a psychiatry without 
biological foundation and divorced from contact with the other 
natural sciences.2_ 
We may perhaps with some advantage analyse in some detail 
one particular study to see how this particular idol of the market- 
"place can find its way into the conclusion of what is otherwise an 
admirable > eriment. - w 1 Š 
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'The experiment chosen for this purpose may justifiably claim 
to be the first attempt to apply’ factor analysis to psychoanalytic 
hypotheses; indeed, apart from serving as our demonstration model 
the work to be discussed has many claims to be included in this 
volume for the sake of the positive contribution it has to make. It 
would have been easy to find a much more obvious and vulnerable 
piece of work; we have chosen on purpose to deal rather with а 
particularly subtle and well-controlled experiment. А 

The study under review deals with the influence of breast feed- 
ing on character-formation (Goldman-Eisler, 1948, 1950; 1951). i 
The very first sentence sets the pattern: ‘The description ofa ul үй 
character in terms of childhood experience is one of the basic 
principles of psychoanalytic characterology, and indeed the genetic 
approach (in the sense of ontogenetic, or referring tò childhoo! 
experience) to human personality is the essence of the theory ап 
method of psychoanalysis.’ The particular hypothesis chosen че 
this experiment is concerned with oral character traits and the? 
origin,"as outlined more particularly by Karl Abraham (1916, 
1924, 1942) and E. Glover (1925). Oral character traits ате believe 
to originate from repressed or deflected oral impulses which are 
dominant during the nursing period and which have undergone 
transformation into certain permanent behaviour patterns by the = 
processes of reaction-formation, displacement; or sublimation. Two 
main syndromes are posited by these writers to emerge from the 
experiences of gratification or frustration attached to the oral stage 
of libido development. One of these is the orally gratified type 
(late weaning), who is supposed to be distinguished by an UP" 
perturbable optimism, Benerosity, bright and sociable conduct, 


accessibility to new ideas, and ambition accompanied by sanguine 
expectation. The other, i 


; : Че orally ungratifie weaning, 
equis teraed by a profoundly по ack uet some" 
times accompanied by moods of depression and attitudes of with? 
drawal, a passive-receptive attitude, a feelin of in: ity, a nee 
for assurance, a grudging feeling of injustic 5 security, а n 


We thus have two hypotheses: (1 
: : Р : I 
relate together in a well-defined Хо ) Gertain traits tend to СО 
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TABLE XXXIV 


(Reliability coefficients are presented in parentheses following the descrip- 
tion of each trait.) 

Optimism: Optimism and pessimism are antithetical expressions in char- 
acter-formation of an omnipotent and magical relation to reality. Optimism 
deriving from this unconscious source would be upheld by the individual 
against reasonable expectation (r = -go). ў 

Pessimism: Inability to accept frustrating experiences as part of reality. 
Defence against disappointments through summary and advance resignation, 
anticipation of disappointment (r — -74). 5 
t 407 Passive-receptive attitude, hedonism, indolence, self-indulgence 

r= 81). 

Desire for the Unattainable: Intense desire to climb combined with a feeling 
of unattainability, of difficulty in achievement, of the insuperable, grudging 
incapacity to get on (r — -78). 

Displaced Oral Aggression (Verbal): Aggressive use of speech, ‘omnipotent 
valuation of speech’, ‘incisive speech’. 

Aggression: The desire to injure or inflict pain, to overcome opposition 
DIR to fight, to revenge an injury, to attack, to oppose forcefully 

т = 378). 
s Aloofness: Negative tropism for people, attitude of rejection (r = +73). 

Ambition: 'Тепдепсу to overcome obstacles, to exercise power, to strive to do 
something difficult as well and as quickly as possible, to attain a high standard, 
to excel one's self (r — -68). : 

Autonomy: Those who wish neither to lead nor to be led, those who want 
to go their own way, uninfluenced and uncoerced. Independence of attitude 
(r = :88).- 

v. Dependence: Tendency to cry, plead or ask for nourishment, love, protection 
or aid, Helplessness, insecurity (r = +56). 

Guilt: ‘Conscience’, inhibiting and punishing images, self-torture, self- 
abasement (r = -go). ; 

Change: A tendency to move and wander, to have no fixed habitation, to 
seek new friends, to adopt new fashions, to change one's interests and vocation. 
Inconsistency and instability (r — -71). : 

Gonservatism: Adherence to certain places, people, and modes of conduct. 
Fixation and limitation. Enduring sentiments and loyalties, persistence of pur- 
pose; consistency of conduct; rigidity of habits (r = -56). n. 

Impulsion: The tendency to act quickly without reflection. Short reaction time, ' 
intuitive or emotional decisions. The inability to inhibit an impulse (r = :82). 

Deliberation: Inhibition, hesitation and reflection before action. Slow 
reaction time, compulsive thinking (r = +87). 

Exocathexis: The positive cathexis in pra 
undertakings. Occupation with outer events: 
occurrences. A strong inclination to participat 
affairs (r = +72). ¢ 

Endocathexis: The cathexis of thought or emotion for its own sake. A pre- 
occupation with inner activities, feelings, fantasies, generalizations, theoretical 
reflections, artistic conceptions, religious ideas; withdrawal from practical life 
Y ram Tendency to nourish, aid, and protect a helpless object. To 
express sympathy, to mother a child, to assist in danger (r = 5707. 

Sociability: Tendency to form friendships and associations; to join, and live 
with, others. To co-operate and converse socially with others. To join groups 


(r = 79 5 
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~ adults constituted the experimental population; for all of these 
ad - statements from their mothers regarding process and time of wean- 
“ing were available. Nineteen scales, mainly of a questionnaire 
type, were constructed or modified from existing scales, to measure 
~ traits which according to the hypothesis under investigation were 
> ~ related to ‘orality’. Table XX XIV gives the titles of these scales, 
- together with a split-half reliability coefficient for each scale. ( The 
=- average number of items in each scale was 8; average reliability 
~~ was 76.) The full scales are published elsewhere (Goldman-Eisler, 
~ ^ 1951). 
E. 19 scales were intercorrelated, and the resulting table is 
“given below (Table XXXV). A factor analysis was carried out, 
and the saturations for two factors are reported in Table XXXVI. 
It will be seen that the first factor does indeed resemble the hypo- 
thetical ‘oral’ type—aloofness, pessimism, endocathexis, conserva- 
tism, deliberation, guilt, and aggression forming the ‘ungratified 
type, and exocathexis, optimism, nurturance, sociability, change, 
and impulsion forming the ‘gratified’ type. So far, then, the 
hypothesis appears to be confirmed. (We shall not here discuss 
_ the interpretation of the second factor, as this would take us too 
far afield. The interested reader is referred to the original paper 
(Goldman-Eisler, 1951).) 
5 _ We must now turn to the second 
with early or late weanin 
on ‘early weaners’ and ‘late weaners’ 


3 defining these in terms of 
of age, and not earlier than 
educed the number of cases 
ompared ‘early weaners', as 
ts’, defined as having been 
of age. Scores for this 
- The analyses are given 

Т. It will be seen that in 
d direction, and that in 
r cent level. The correla- 
ed orality’ is 271 accord- 
ng to the second analysis. 
the correlation between 
nclude ‘early weaning’ in 
ration with the first factor. 
thus confirming the previous result which 
pproximately -3 between oral type and 
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TABLE XXXVI 


^ | Factor Saturations Factor Saturations 


i а ss Trait =ч 
Ттай i T E | 1 n 

Optimism — 680 —298 | Guilt 208 —352 
Pessimism = 542 —041 | Change | —272 267 
Passivity 376 | 214 | Conservatism 370 —436 
Unattainable 050 | 209 Impulsion — 104 761 
Oral Aggression —150"! 804 Deliberation 20 шы 188 
Aggression 209 | 479  Exocathexis , —678 | —31 
Aloofness 750 234 | Endocathexis 477 160 
Ambition —192 ^ 153 | Nurturance —573 | —201 
Autonomy 156 | | 560 | Sociability —499 | —903! 
Dependence 245 | —296 | 


TABLE XXXVII 


; ———À 
Source Degrees of Sum of Mean 


Freedom Squares Square 
Within weaning group 88 22-267 :253 
Between early and late weaning group | I 2:033 2:033 
| 89 24:300 2730 
Е = 8:04 Epsilon = -271 
TABLE XXXVIII 
- - —————— EU ounces ——- —— — 7 —_—— 
5 Degrees of Sumof | Mean 
т | Freedom ped | Square 
Within weaning group | 64 | 16:271 254 
Between early and very late weaning 
group | 1 гдоф | 1 :904 
| 65 18-175 | :280 


POS RR 


weaning. We might thus conclude that both hypotheses had been 

+ Verified; we have found the hypothetical type to exist roughly 4s 
posited, and we have found this type to be related to weaning in 
the predicted direction at a high level of confidence, It is at this 
point that we must bring our criticism to bear. 
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Let us first of all look closely at the traits characterizing the 
oral type. Comparison with results of factorial analyses published 
elsewhere in this volume will show that the structure of traits found 
here is very similar to that which has given rise to the hypothesis 
of an introvert-extravert dichotomy; the results of the factorial 
analysis agree with this interpretation at least as well as they do 
with that made by Goldman-Eisler. This, of course, is not entirely 
unexpected; when experienced psychologists and psychoanalysts 
set down syndromes of trait resemblances which they have observed 
in their experience it is not unlikely that they should agree in their 
observations, even though they may give different names to the 
syndromes thus isolated. It is important, though, to remember that 
for Jung and his followers extraversion or introversion are largely 
determined by constitution, while for Freud and his followers the 
‘oral’ type is determined entirely in terms of early childhood 
experience. Thus the results of the factorial study do not confirm 
Freud any more than they confirm Jung; as far as the crucial 
difference between them is concerned the results are neutral. 

What can we say of the correlation of this typology with wean- 
ing? Surely the existence of a correlation as small as :3—even 
though it be fully significant—cannot under any circumstances be 
given a causal interpretation. It is equally plausible to suggest that 
introverted mothers tend to have introverted children by the action 
of genetic factors; that introverted mothers tend to wean their 
children earlier because of their lack of ‘exocathexis’; and that 
consequently there will arise a correlation between early wean- 
ing and child's introversion which can all too easily mislead the, 
investigator to believe in some principle of direct causation. 

There is an obvious objection to this line of argument. It may 
be maintained that in criticizing the experiment we have left out a 
crucial part. Did not the investigator set out to test the predicted 
consequences of a clearly formulated theory? What more can a 
single experiment do than to verify such a deduction? Is not too 
high a standard being applied to experiments of this kind, a 
standard which contradicts the logic of scientific methodology? 

The answer to this objection is very simple. The deduction 
from a theory the testing of which may serve as а confirmation of 
the theory must refer to phenomena which themselves are as yet 
unknown. This essential requirement is not fulfilled in the present 
case. Abraham and Glover based their hypothesis on the observa- 
tion that among their patients there appeared to be a close con- 
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comitance between personality and certain weaning practices in 4 
their childhood. From this observation they erected the hypothesis — 
of the oral type. But clearly this hypothesis cannot be tested simply 
by confirming the original observation of a correlation between 
oral type and weaning! To test it would require a deduction 
different from the original observation. Such a deduction should 
preferably be made in such a way as to provide a crucial experi- 
ment between the two rival hypotheses outlined above. In the 
absence of such an experiment, all that the Goldman-Eisler study 
has done is to confirm in a very impressive manner Abraham’s and 
Glover’s original observation of a correlation, The interpretation 
of that correlation is not affected in any way by her experiment, 
‘and it would be indicative of ‘environmentalist’ prejudice to inter- 
pret it as favouring the Freudian view, just as it would be evidence 


^. ofa ‘hereditarian’ prejudice to interpret it as favouring the Jungian 
view. 


>»? 


+ 


x t* It is because the former prejudice is so widespread and so 
«э insidious that we have included this extended discussion here: 
~ Implicit assumptions which govern interpretation of data in the 
: ini "absence of factual evidence may be far more harmful to a science 
than explicit errors which can easily be put right. The environ- 
mentalist assumption underlies the greater part of contemporary 
work in the field of personality, and still determines interpretation 
of strictly neutral material in the complete absence of confirmatory 
evidence. This does not mean that the environmentalist hypothesis 
; 5 necessarily false; we have no knowledge on which to base an 
opinion. The need, as always, is for well designed experiments 
which will enable us to cease arguing, and substitute fact for 
interpretation. 


Chapter Six 


THE PSYCHOTIC DIMENSION 


N Dimensions of Personality the writer has discussed briefly the 
I: relationship between the factor of extraversion-introver- 
sion isolated there, and the schizothymia-cyclothymia typology , 
advocated by Kretschmer. It was felt that the facile equation of 
the introvert with the schizothyme, which many writers in the 
field of personality have assumed as a fact, could not be maintained — 
in view of the explicit statements by Jung ( (1923) and Kretschmer í 
(1948) regarding the position of thë hysteric in their крыше, 9 
“typologies. For Jung, the hysteric is a prototype of the extrovert; - 


for Kretschmer, the hysteric has close affinities to the schizophrenic “232 


and belongs, therefore, quite definitely to the schizothyme group. — 
As it is clearly impossible for the hysteric to be both extroverted. 
and schizothymic; if we equate introversion with schizothymia, it 

is impossible to regard these two typologies as identical. In view of 
the absence of independent evidence on this point, no positive sug- 
_gestions were made regarding the position,of Kretschmerian types 
in our own dimensional system of personality. However, clearly, 
this was an unsatisfactory position and an experimental integra- 
tion of the two еи became a priority in our experimental 
programme. 

Another weitem arose in this connection which also demanded _ 

an empirical answer. As will be shown, Kretschmer's hypothesis of 

a cyclothymia-schizothymia dimension of personality is dependent 
on the assumption of an essential continuity between normal and 
psychotic mental states; in other words, he postulates a normal- 
psychotic continuum, or a factor of ‘psychoticism’. In the same _ 
way, Jung implies a factor of ‘neuroticism’ in making neurotic А 
syndromes prototypes of his normal typology. The question arises 
whether these two dimensions of neuroticism and psychoticism are 
identical (assuming they both exist), or whether they must be 
regarded as clearly separate and unrelated. Existing theories are 
not at all helpful on this point. Freud, who posits a general factor 
of regression, would identify the neuroticisri and psychoticism 
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factors, describing the psychotic as one who has regressed most 
severely, and the neurotic as one who has regressed less severely; 
the normal person, presumably, showing signs of regression. Most 
orthodox psychiatrists seem to assume that neuroticism and psy- 
choticism are situated along two unrelated dimensions; however, 
their actual practice in diagnosing patients does not seem to follow 
their theoretical point of view. A patient is diagnosed, for instance, 
either as a schizophrenic or as an hysteric. If these two labels are 
considered to lie on orthogonal axes, there is no reason why а 
patient should not be diagnosed as suffering from both these dis- 
orders, just as a patient might be an hysteric and intelligent, or a 

_ schizophrenic and tall. This current procedure could only be 
defended if we assumed that psychotic and neurotic states were 
qualitatively different from the normal, a belief held more fre- 

. quently with respect to psychotics than with respect to neurotics. 
It will be clear that the fielá is in a somewhat chaotic state and 
that not only do different writers flatly contradict each other, but 
that tlie same author at different times will follow different and 
„often contradictory hypotheses. Again an empirical clarification is 
essential. 

This chapter, then, is devoted to a description of the Kretsch- ` 
merian system and to a report on our experimental results, which 
were sufficiently clear-cut to permit at least a preliminary and 
PE qs interpretation. The account of Kretschmer's system 
fist glance; the reasons for tis дееп зу арааг ary аї 
In the Ent place, Kretsch pet саге description аге threcfold. 
RUFUS Vd Pl: mer's system is known to the Anglo- 

хс : rely through the translation ofa very early 
edition of his book; the important revisions and the large amount 
of experimental evidence accumulated since by Kretschmer and 
his students is hardly ever mentioned or taken into account. This 
means, as the writer has shown elsewhere (19500, 1951), that critics 
are still busy demolishing a position which Kretschmer himself 
left about thirty years ago, and fail to appreciate the important 
advances which he has made since. An up-to-date exposition of 
Kretschmer’s system, therefore, seems an absolute necessity for an 

adequate understanding of the problem involved. 
я їш Ae second place, Kretschmer has been dealing for a long 
me wi methodological problems in the field of typology which 
ae uu р шге ш connection with the general aim of this 
nd which have again hardly received the attention which 
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they deserve, although the writer does not believe that the methods 
which Kretschmer suggested and used in the 1920's are as powerful 
and convincing as the factorial methods introduced since. They 
are of great interest and importance, nevertheless, and deserve 
more critical appreciation and discussion than they have received 
hitherto. 

In the third place, Kretschmer and his students have originated 
subsidiary hypotheses and have constructed tests to investigate 
these hypotheses, which appear to the writer far in advance of 
most of the work being carried out in the field of personality 
research in other countries, using much more sophisticated statis- | 
tical methods. Indeed, to anyone surveying personality research _ 
dispassionately, it will be only too obvious that there exists a rather ` 
high negative correlation between, on the one hand, psychological 
insight, astute framing of hypotheses, and general understand- 
ing of personality structure, and, on the other hand, statistical - 
sophistication, ability in the construction of experimental tests for 
the testing of hypotheses, and expert knowledge of methods of 
ascertaining validity and reliability. Kretschmer, although his 
approach is deficient in the elaborate statistics which have recently 
been the vogue, nevertheless combines these two indispensable 
sides of personality research to an unusual degree, and his work 
deserves to be better known than it is..It is not good for science 
that a stereotyped view of a man’s contribution should obscure 
aspects of that contribution which do not fit in with the stereotype. 
It is in the hope of destroying certain erroncous notions and aiding 
a better appreciation of Kretschmer's real contribution that the 
first part of this chapter has been written. 

Kretschmer’s system is a typology, but it would be very wrong 
to imagine that his conception of ‘type’ is similar to that so 
frequently criticized in elementary text-books. In this simplified 
kind of presentation, the concept of ‘type’ is often contrasted with 
the concept of ‘trait? and it is suggested that the ‘type’ approach 
implies a bimodal type of distribution, whereas the ‘trait’ approach . 
implies a unimodal type of distribution. The argument is often 
advanced that as most human characteristics can be shown to vary 
unimodally, the ‘type’ approach must be erroneous. This argu- 
ment is, of course, fallacious, as the observed distribution of scores 
on a test which has no rational metric underlying it has no neces- 
sary connection with the true distribution of scores. All we have 
available with psychological tests are the observed distributions, 
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' ofc urse, and to argue from those to the ‘true’ distributions is not 
. admissible. 
s In any case, regardless of the validity of the argument from 
~ © distribution, the concept of ‘type’ as being essentially bimodal is 
not in accord with Kretschmer’s own definition (or, indeed, with 
that of Jung or most other Continental typologists). Kretschmer 
makes his meaning quite clear. “The concept of type is the most 
important fundamental concept of all biology. Nature . . . doesnot 
work with sharp contrasts and precise definitions, which derive 
from our own thought and our own need for comprehension. In 
. nature, fluid transitions are the rule, but it would not be true to 
say that, in this infinite sea of fluid empirical forms, nothing clear 
and objective could be seen; quite on the contrary. In certain 
fields, groupings arise which we encounter again and again; when 
we study them objectively, we realize that we are dealing here with 
_ focal points of frequently occurring groups of characteristics, con- 
. centrations of correlated traits... . What is essential in biology, 
as in clinical medicine, is not a single correlation, but groups of 
correlations; only those lead to the innermost connections. It is 
daily experience in the field of typology, which can be deduced 
quite easily from the general theory, that in dealing with groups of 
characteristics one obtains higher correlations than with single 
characteristics. ‚.. What.we call, mathematically, focal points of 
statistical correlations, we call, in more descriptive prose, con- 
stitutional types. The two are identical; it is only the point of view 
which differs... . A true type can be recognized by the fact that 
it leads to ever more connections of biological importance. Where 
there are many and ever new correlations with fundamental bio- 
logical factors, for instance, in the constitutional types dealt with 
here, we are dealing with focal points of the greatest importance.’ 
It will be seen that the conception of type which Kretschmer has 
elaborated here is very similar to the one given by the writer, who 
regards types as ‘observed constellations or syndromes of traits’, 
and traits as ‘observed constellations of individual action tend- 
encies’ (1947). This use of the term type, which is not far removed 
from the conception discussed by Murphy and Jensen (1932), 
appears to be free from the defects which the concept of type is 
often alleged to possess. 


The particular system of correlations which Kretschmer chooses: 
85 à starting point lies in the constitutional field, i.e. in the field of 
body types, where he contrasts, as is well known, the pyknic and 
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the leptosomatic types, with a third one, the athletic, cone deeds д 
sometimes intermediate and sometimes definitely divergent f from- 
the other two. In addition, he has dysplastic types and a certain 1pro- TA 
portion of unassignable doubtfuls. While Kretschmer's approach ` 
is not exactly along statistical lines, a number of factorial studies 
of intercorrelations between bodily dimensions, reviewed by the 
writer (Eysenck, 1947), have shown that essentially there is a main 
dichotomy corresponding closely to his pyknic-leptosomatic type, 
and there can, therefore, be little doubt that this first step in his 
typology is eminently sound, although it may be suggested that 
greater statistical sophistication might have enabled him to purify | _ 
his concepts rather more, and also to arrive at better indices of 
body build than those elaborated in his book. 

The position of the athletic type raises important points here, 
a consideration of which cannot be deferred. Kretschmer and 
Enke (1936) consider the athletic type as being essentially different - __ 
from both the others rather than as being intermediate between 
them. This position appears unacceptable to us. In the first place, 
there is no independent statistical proof, such as only factor analysis 
could supply: in our own work we found no evidence of a factor of 
this kind (Eysenck, 1947). In the second place, in the large body 
of experimental work in which Kretschmer and his followers have 
tried to differentiate the three body types with respect to psycho- 
logical functions, it will be found in almost every case that the 
athletics are intermediate between pyknics and the leptosomatics, 
though somewhat closer to the latter. This suggests very strongly 
that they are not in a separate group, but are, in truth, inter- 
mediate. The attempt of Kretschmer and Enke to create a third 
type from the athletics often leads to rather disingenuous argu- 
ments. Finding usually that leptosomatics and athletics differ very 
little in their test results and are opposed to the pyknics, they have 
to argue that the causes which lead to the test results of lepto- 
somatics are different from those which lead to the test results 
of the athletics, although no evidence of any kind is given to sup- 
port these ad hoc arguments. The reader will be able to judge 
this point from the experiments quoted elsewhere (Eysenck, 
1951), and detailed data are given there for all three groups 
separately. 

If the pyknic-leptosomatic typology is not only justified, but 
also fruitful, we should expect to find a large number of correla- 
tions of psychological importance. However, it may be noted that 


. points made by Kretschmer—i.e. the prev 
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“there are also a large number of physiological correlations, par- 
ticularly in the field ofautonomic functioning and of endocrinology, 
which deserve some mention; these have been summarized briefly 
by the writer elsewhere (Eysenck, 10506, 1951). 22 

Having laid his foundation in terms of body build, it is well 
known that Kretschmer proceeds to point out that there is close 
affinity between the manic-depressive type of insanity and pyknic 

‘body build on the one hand, and between schizophrenic disorders 
of all kinds and the leptosomatic (and to a smaller extent the 

athletic) type on the other. He also points out the particular 

_ affinity obtaining between athletic body build and epilepsy, basing 
himself, in part, on Dubitscher’s work with the Rorschach Test 
(1932), who found among athletics reactions very similar to those 
found by Rorschach among epileptics. Westphal (1931) gives а 


table embodying over 8000 cases showing these relations fairly 
clearly (Table XXXIX). ' , 


- TABLE XXXIX 


E IRAN — p — m À—— 


Body Build | Schizophrenics: | Manic Depressives: Epileptics: 
? 5233 cases | 1361 cases 1505 cases 
a —= À a 
% | % % 
Pyknic 19* 64:6 5:5 
Athletic s109 | © ER 
Leptosomatic 50:3 19:2 25:1 
Dysplastic , 1075 | I1 29:5 
. Doubtful 8-6 | 847 | 11:0 
Лр о... fees | 
Тһе writer has reviewed the literature with respect to these 
somatopsychic relations elsewhere (Eysenck, 1947) and will not 


repeat his conclusions; by and large, we may accept the main 
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body build among schizophrenics and of py! 
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. problem of constitution in its full importance. There is no jump 
in thus going over into normal psychology, but as we follow the 
threads between body build and psychological peculiarity from 
the psychotic, step by step, through all types of psychopathic 
personality and get further and further away from those great 
mental disturbances which form the beginning of our investiga- 
tion—lo and behold—suddenly, we find ourselves among healthy 
people, among well-known faces. Here we recognize as familiar, 
normal features those traits which previously we had seen in cari- 
cature. We find the same types of facial structure, the same stigmata | 
of bodily constitution, and we find that behind the same exterior , 
dwell the same psychological forces.’ 

This general theory may, perhaps, be introduced by reference 
to Figure 26. This figure shows the distribution ofthe whole popula- 
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tion in terms of a normal curve of distribution, ranging from one 
extreme (schizophrenia) to the other (manic-depressive insanity). 
All persons left of the mean would be schizothymic, meaning by 
that merely that their personality make-up has in common certain 
elements which are grotesquely exaggerated in those psychotic 
patients whom we label schizophrenics; whereas all those to the 
right of the mean would be cyclothymics, meaning by that that 
their personality make up has in common certain elements which 
are grotesquely exaggerated in manic-depressive patients. Persons 
who are definitely abnormal but not yet psychotic Kretschmer 
calls schizoid or cycloid respectively, whereas the large number of 
persons in the centre of the distribution he calls syntonic, if they 
are on the cyclothymic side, and dystonic if they are on the schizo- 
thymic side. It is possible that Kretschmer would object to the use 
of a normal curve to depict the relation between schizothymes and 
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cyclothymes but little importance can be attributed in any case to 
the form of distribution when the underlying metric is unknown; 
just as in the case of the distribution of intelligence, the normal 
curve must be regarded merely as a convenient device rather than. 
an accurate representation of actuality. 

We believe that in all essentials, Figure 26 brings out accurately 
the main points of Kretschmer's views. We see, therefore, that 
Kretschmer is suggesting a definite dimension of personality, which 
we may call cyclothymia-schizothymia, but it would seem to follow 
from his writings that another dimension is also implied, ranging 

‚ from normality to psychotic disorder and orthogonal to the first, 
so that his theory can best be represented in terms of two ortho- 
gonal axes, one measuring schizothymia-cyclothymia, the other 
normality-psychotic abnormality or ‘psychoticism’. Indeed, if we 
were to follow him faithfully, we would have to add two further 
dimensions, namely, the diatlietic and the psychasthetic scales. 
In his view, cyclothymes vary among themselves on a scale ranging 
from hutnorous, vivacious, quick-witted, to the quiet, calm, serious 
—the so-called diathetic scale; whereas schizothymes vary from 
shy, nervous, sensitive, to dull, stupid, torpid—the so-called psy- - 
chasthetic scale. As, however, there is no experimental evidence 1n 
Kretschmer's work regarding these scales, and as he makes little 
use of them and does not define their relation to each other in any 
way, we have thought it better to simplify the problem by con- 
centrating on his major hypothesis rather than оп these subsidiary 

” z The student of this problem, however, should keep them in 
mind. 

Attempts have been made by Kretschmer (1948), Van Der 
Horst (1916, 1924), Kibler (1925), and Zerbe (1929) to make direct 
2 "tests of the association between mental type and physical 

аы ? T a physical type; 

^. assessing the former by means of interviews ог questionnaires, an 
statistical tests applied to these figures by the Writer have shown 
that the results are in accordance with the hypothesi | 9 ч high 
level of confidence (Eysenck, 1950c). However, Eni. is 
rightly, lays most stress on a somewhat indirect Droesdure i ub 
:18 derived from Van der Horst. “There always appeared a marked 
bject and the schizo- 
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the psychological structure of the pyknic and the manic-depres- 
sive.’ Put in other words, Van de Horst and Kretschmer try to 
show the general validity of the concept of schizothymia and cyclo- 
thymia beyond the psychotic realm by using body build as a 
tertium quid; their method of proof is to show that normal people of 
leptosomatic or pyknic body build react to certain psychological 


эк 


experiencesin a manner similar to that of schizophrenes and manic- 


depressives, who are known to be also leptosomatic or pyknic on 
the average. This is an ingenious method which deserved careful 
consideration. Some of the results from it, particularly those of | 
Van der Horst himself (1916, 1924) and of Kibler (1925) are very . 
impressive indeed. Fundamentally, however, it has certain weak- 
nesses which make it doubtful whether any definite conclusions 
can be derived in this way. As a methodology it follows only partly 
the rules of the hypothetico-deductive method, as the results 
deduced from the hypothesis are net stated in a rigorous enough 
fashion to make proof or disproof possible. This is particularly so 
in view of the fact that the authors quoted do not state their results 
in terms which are amenable to proper statistical treatment. How- 
ever, in spite of these criticisms, it should be realized that here we 
have a method which could be made into an extremely powerful 
tool by slight changes in methodology and procedure, and when it 
is realized that the work to be described was carried out at a time 
when the research genius of the rest of the world was still gazing 
upon the Bernreuter Inventory as a лоп flus ultra of personality 
tests, it will be clear that here we are dealing with a serious effort. 4. 
to come to grips with a problem of fundamental importance at a 
time when its very existence was realized by very few other workers. 
We shall return to these methodological considerations later on 
and attempt to suggest an improved method for solving the very - 
difficult problem which Kretschmer set himself, by means of a ~ 
modification of the factorial approach. Before doing that, however, — . 
we shall briefly summarize various researches falling under this : 
general heading. As has been mentioned before, Kretschmer's con- 
ception oftype implies the discovery of certain correlations between . 
traits. Thus, the schizothyme in his view would be found to be ~ С 
dissociative, form-reactive, high in personal tempo, better in motor, 
co-ordination, more perseverative, and with greater autonomic 
reactivity. It is the intercorrelation of these (as well as many other) : 
traits which define the schizothyme pole of the type continuum as. 
opposed to the cyclothyme pole, which would:be characterized by 


x 
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traits showing him to be integrative, colour reactive, low in personal 
tempo, poorer in motor co-ordination, less perseverative, and with 
less autonomic reactivity. The experimental evidence for these 
various assertions has been discussed at some length elsewhere 
(Eysenck, 10506, 1951); here we may take one of these traits as an 
example of the treatment which Kretschmer gives to these gona 
cepts. For this purpose, the trait which he calls ‘Spaltungsfaehigkeit 
has been chosen, i.e. a trait characterized at the one pole by an 
ability to dissociate, at the other pole by a tendency to integrate 
diverse mental content. As is the case with most of Kretschmer 5 
work, the hypothesis is derived originally from a close clinical 
study of large numbers of subjects. It is then given an operational 
definition by the elaboration of a number of objective tests which 
are conceived to measure the particular trait hypothesized. These 
measures are then applied to groups of schizophrenes and manic- 
depressives and normal leptosomatics and pyknics, and the hypo- 
thesis is considered to be verified if Observed differences lie in the 
expected direction, at a reasonable level of statistical confidence. * 

Kretschmer holds that the concept of dissociation (Spaltung) is 
of fundamental importance in understanding the mentality of the 
schizothyme, just as its opposite, integration, is important for the 
understanding of the cyclothyme mentality. His concept of dis- 
sociation goes further than that of Warren (1934): ‘The breaking 
up of a combination of any sort into its constituents. He means by 


form separate and 


single act of consciousness: from this results the ability to dissect 


into its constituent parts.’ This tendency towards ` 


geration, the manic-depressive. Ё 
A brief description of som 

Kretschmer and his students 

the discussion more realistic. 


( 


€ of the experiments performed by 
to quantify this concept will make 


paper already referred to 
that for each of the experiments 
adequate at the -o1 level. 


(Eysenck, 19506, 1951) 
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measure the disturbing effect of various agents such as noise, a 
flashing light, etc. These distracting stimuli, according to the hypo- 
thesis, should lengthen the reaction-time of manic-depressives and 
of normal pyknics much more than that of schizophrenics or of 
normal leptosomatics, as the schizothyme group should be able to 
keep the two mental contents (experimental stimulus and disturb- 
ance) separate to a. much greater extent than the cyclothymes, for 
whom one should interfere much more with the other. 

(2) In another experiment, the subject has to remember the 
numbers of differently coloured squares on a card which is being 
presented to him, one row at a time, the theory underlying the 
experiment being that the schizothyme, with his dissociative ability, | 
would easily be able to carry in his mind the number.of different 
categories into which to classify these various coloured squares, 
and that he would be quicker and more accurate in the total task. 

(3) In this experiment, coloured groups of nonsense syllables 
are shown to the subject in a tachistoscope with instructions to 
observe either the colours or the letters. He is later questioned about 
both letters and colours. The hypothesis underlying the experiment 
would require the schizothyme, with his higher abstractive ability, 
to be able to observe what is required and to pay no attention 
to other features of the stimulus, whereas the cyclothyme would 
remember more of what he was not asked to observe and less of 
what he was asked to observe. ? 

(4) Long, unfemiliar words are presented tachistoscopically, 
being shown 10 times in succession so as to facilitate reading of the 
word, which could not be completed at one exposure. 'There are 
two ways of getting at the meaning of the word: (1) the abstractive, 
analytic, dissociative method, in which the total word is built up 
by reading successive letters and syllables and constructed as a 
whole from these parts, and (ii) the global, synthetic, integrative 
method in which a single impression is obtained and then elabor- 
ated in successive exposures. The first of these methods would 
be expected to characterize the schizothyme, the second the 
cyclothyme. 

(5) In this experiment, an attempt is made to study the effect 
of mental addition ‘of twenty numbers on the regular rhythm of 
ergograph work. It is hypothesized that schizothymes would be 
capable more easily than cyclothymes to keep these two different 
tasks separate, so that they would give more correct answers in the 
one and have less disruption of the regular rhythm in the other task. 

8.8Р. Р 
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-^ (6) On the Rorschach test we would expect schizothymes ae 
ч cyclothymes to show marked differences when a nS d af 
. made between the number of whole (W) versus the num 5 1 
© © detailed (D, d) answers, an expectation borne out in actual в. 
HEURE These are only six of a much larger number of experiments | 
. carried out by Kretschmer, but they do illustrate the ер р 
. hypothetical trait of dissociative ability, „апа as each of дена 
experiments has been shown to differentiate significantly ES ^ 
"least one case between extreme groups on the schizothymia-cyc o- 
ge thymia continuum, it must be admitted that there is, at least, some 
RA evidence here which favours the Kretschmerian hypothesis. In an 
tc unpublished factorial analysis involving the tests mentioned, as - 
Well as an intelligence test and a measure of body type (the Rees- 
‘ — Eysenck index of body-build described in Dimensions of Personalit ) ! 
- the writer was able to show that a general factor was present in a 
these tests and in the measure of body type, even when ууссан 
: had been partialled out. This experiment was only done on a smal 
^ number (N — 44) of unselected university students at the Univér- 
“sity of Pennsylvania and until results from a replication, at present — - 
-in progress with much larger numbers, are available, it cannot be 
.'- consi ered anything but suggestive. However, it can be seen that 
| -as far as experimental results 80, they do not contradict but tend 
to support the Kretschmerian hypothesis, 
What has been said 9f this particular trait appears to be true, 
mutatis mutandis, of the other 
able evidence has been di 
+ 1950г, 1951); in the absence of fa 
_ traits, we cannot admit that th 
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criminates bétween cyclothymes and schizothymes and that. 
another test of perseveration also discriminates in the same way; | 
this he seems to regard as sufficient proof that both tests measure 
perseveration, which is, of course, a logical fallacy. The writer has. 
shown that a test of suggestibility discriminates between neurotic 5 
and normals and that a test of persistence also discriminates in 
this fashion, and that accordingly the two tests correlate (Eysenck, 
1947). It does not follow from this that persistence is the same thing 
as suggestibility, and, indeed, the proof that the tests in question 4 
are tests of persistence or suggestibility has to be given in terms of 2 
separate factorial studies embodying different types of suggesti- - 4, 
bility and persistence tests. Rui te We 
While Kretschmer places much importance on these traits, ам « 
we may, perhaps, call them in conformity with modern psycho- . 
logical usage, the mainstay of his whole system, of course, is the 
cyclothymic-schizothymic dichotomy. We must criticize his 
method of proving that this dichotomy, whose existence in the — + 
psychotic field few would deny, can and should be accepted in the —.- 
normal field. A more direct method than the one used by him, > 
which relies on the possibility irrelevant tertium quid (the pyknic- e 
leptosomatic dichotomy) is required for this purpose, and isto be ~ - 
found in the method of criterioanalysis. Although Kretschmer 
shows that there is a correlation between body-build and the two 
main types of functional psychotic disorder, that correlation is not . 
very high, and the writer has shown elsewhere (Eysenck, 1947) 4 
that body-build is correlated with other variables ( neuroticism, %.2 
extraversion-introversion) which are unrelated to the Kretsch--.— 
merian concepts. If that be so, then clearly many of Kretsch- 
mer's findings, which appear superficially to support his views, ~ 
may in reality have quite a different explanation. To give just one 
example, Kretschmer has shown that leptosomatics are slow and. 
accurate while pyknics are quick and inaccurate. This may be 
interpreted in terms of his system; however, an alternative ex- 
planation is also possible as we have shown that hysterics tend 
to be more of a eurymorphic (pyknic) body-build than are 
dysthymics, who tend to be leptomorphic in body-build. It has also 
been shown that hysterics tend to be quick and inaccurate, 
whereas dysthymics tend to be slow and accurate. It follows from 
this that there should be a correlation between body-build and a 
speed-accuracy test, but the interpretation of the experiment may 
be in terms quite different from those advanced by Kretschmer, 
| E : 
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using concepts essentially alien to his system. It is because of the 
danger of indirect proof exemplified in this case that a more 
direct method of examining Kretschmer's hypothesis becomes 
necessary. Let us turn then to a more direct test of Kretschmer's 
hypothesis, using the method of criterion analysis. 

In order to use the hypothetico-deductive method in this field, 
it is particularly important to state the hypotheses to be tested 
quite clearly, and to make deductions from them which can be 
tested empirically. As explained before, there are two main 
hypotheses involved in Kretschmer's System: (1) The functional 
psychoses (schizophrenia and manic-depressive insanity) are not 
` qualitatively different from normal mental states, but form one 

extreme of a continuum which goes all the way from the perfectly 
normal rational to the completely insane, psychotic individual. 
All possible intermediate Stages are represented on this con- 
tinuum. (2) The two main functional psychoses ( 
and manic-depressive insanity 
observable in non- 
extreme degree, and 
the extreme schizot 


system, but does not appear essential to it 


he third hypothesis, being 
| » Will not be tested in the 
present experiment. 


We have attempted to state Kretschmer's hypotheses in such 
à way that a statistical and experimental test of them becomes 
possible; while we believe that in stating them in this fashion we 
any way, and while we believe 
» explicitly or implicitly, it should 
: educing a complex and difficult 
System to two brief fundamentals 
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conjunction with the problem posed by Kretschmer’s hypothesis. 
Let the line AB in Figure 27 represent the normal-psychotic 
continuum, and let the line І cut off at point X that part of the 
continuum containing mental states conventionally diagnosed 
‘psychotic’ by psychiatrists. (The distribution of the total popula- 
tion has been tentatively included in the figure in the form of a 
normal curve of distribution. As the actual form of the distribution 
18 irrelevant to the argument, which merely hypothesizes a con- 
tinuous distribution, any other form of rectilinear or curvilinear 
distribution might be substituted for the normal form without 
affecting the argument.) Let n objective psychological tests (а, ih 
с,... 1) be given to the two populations separated by the line 
LT’, i.e. to a normal group and to a psychotic group. (The term 


A M p 
NORMAL PSYCHOTIC 


Figure 27 


‘normal’ here means nothing but ‘not under psychiatric care for 
mental disorders’; it does not imply anything more positive than 
this absence of demonstrable and demonstrated mental disorder.) 
Let us assume that each of these n tests distinguishes significantly 
between the normal and the psychotic groups. (In actual practice, 
n + x tests would have to be given in all, so that tests not dis- 
tinguishing at the chosen level of significance could be rejected.) 

Now let us divide the normal group into two parts, by making 
а cut at point Т, on the line AX; similarly, let us make а cut at 
point M on the line XB, thus subdividing the psychotic group also. 
(Points L and M may be anywhere between A and X, and x and 
В, respectively; there is no implication that they should divide 
the respective populations into equal halves.) On the hypothesis 
that AB represents a true continuum, we have nowedivided both 
the normal and the psychotic groups into two parts, one of them 
more normal, the other more psychotic. Group AL is more normal 
than LX; group XM is more normal than group MB. If AB is a 
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true continuum (the hypothesis to be tested), and if each of our 


‚ п tests is related to that continuum in a linear fashion (supple- 


` 


mentary hypothesis), then it would follow that on these n tests 
group AL would differ from group LX in the same way that 
group AX differed originally from group XB. Similarly, group 
XM would differ from group MB in the same way that AL differed 
from LX. 

We may put these arguments in a,form which permits of their 
being tested. If tests a and b differentiate significantly between 
groups AX and XB, then it would follow according to our hypo- 
thesis that they should differentiate also between AL and LX. 
Similarly, they should differentiate between XM and MB. This 
deduction, unfortunately, does not permit of any direct test, as 
there is no known method of determining points L and M, and 
therefore of differentiating the groups under discussion. However, 
we can transform the concept of ‘differentiate’ into the concept of 
‘correlate’, and test our hypothesis in this fashion. Ifon tests a and 
b the normal group does better than the psychotic group, it would 
follow, as explained above, that group AL should do better on 
both tests than group LX. Similarly, group XM should do better 
on both tests than group MB. But these statements are synonymous 
with saying that for both the normal and the psychotic groups separately, 
there should be a positive corilation between tests a and b. 

This argument can be extended 
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our п tests. Let us assume that we extract two factors from each 
of our two matrices, the sets of intercorrelations for the normal and 
the psychotic groups respectively, which we may сай Е, and Е, 
for the psychotic and Е, and Е, for the normal group. It would 
follow from our hypothesis and our selection of tests that F, and F, 
should be proportional to each other, and to our Criterion Column, 
Съ. Here, then, we have reached the final and crucial test. We 
have made a hypothesis as to the existence of a general factor 
underlying the pattern of variances and covariances of test per- 
formances of normal and psychotic groups, and deduced certain 
consequences which would follow if our hypothesis were true, and 
which would not follow on any tenable counter-hypothesis. In so 
far as these deductions are verified, we may consider our hypo- 
thesis as supported; in so far as these deductions are not verified, 
we may consider our hypothesis as disproven. У 

It will be clear that a similar procedure could be used with 
respect to our second hypothesis. What would be required there 
would be a set of n tests which discriminate between schizophrenic 
and manic-depressive patients; the Criterion Column in this case 
would consist of the biserial correlations of each test with this 
new criterion. Apart from these changes, the procedure as out- 
lined above is applicable to just the same extent as it is in respect 
to our first hypothesis. The continuum in question is shown Ш 
diagrammatic form in Figure 26; on the basis of Kretschmer's 
hypothesis we would expect factors Ё, and F,’ to be proportional 
to cach other, and also to the new Criterion Column, С.а 

Во hypotheses can be tested.at the same time, using 7 tests 
Which differentiate significantly (using Fisher's F test) between 
the three groups involved, the normal, schizophrenic, and manic- 
depressive. The two criterion columns, С„ and Ce could then 
be calculated quite easily by running biserial correlations between 
cach of the п tests and the normal versus psychotic dichotomy for 
Cus (using the combined schizophrenic and manic-depressive 
8roups to form the psychotic group), and by running biserial 
correlations between each of the z tests and the schizophrenic- 
manic-depressive dichotomy for Csa Product-moment correla- 
tion would then be run between the 7 tests for the normal and the 
psychotic groups separately, thus giving us two separate matrices. 
These, when factor analysed according to either the centroid or 
the summation method, or indeed any of the current methods, 
should result in two factors each, F and F', which would 
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be-proportional to each other, and to the respective criterion 
columns. . 

This, in brief, is the method which has been used in the present 
experiment. The experimental population consisted of тоо normal 
subjects and roo psychotic subjects divided into two equal groups 
consisting of 50 manic-depressives and 50 schizophrenics.1 Al- 
together, 84 sets of scores were derived from over 30 independent 
objective tests given to these three groups, of which the great 
majority were significant when the F test was applied to the scores 
of these three groups. The tests used included four tests of fluency, 
the Crown Word Connection List, a work-curve test, a colour- 

* form test, a mirror-drawing test, a social-attitude test, two level 
of aspiration tests, a concentration test, 11 tests of expressive 
movements, two tests of perseveration, one test of dissociation 
(Spaltungsfaehigkeit), one test of tapping, four tests of oscillation, 
one speed-accuracy test, one suggestibility test, one tracing test, 
and one persistence test. Results of these various tests are given in 
detail elsewhere (Eysenck, 1951). The results may be summarized 
verbally, drawing attention to the following points. In the first 
place, the large number of significant differences found shows that 
the three groups tested are very unlikely to have come from a 
single universe; there is good reason to believe that the selection 
procedures have been successful in giving us subjects showing 


much greater heterogeneity than would have been expected by 
chance. À 


In the second place, it appears that on the majority of tests, the 
schizophrenic group sco 


manic-depressive grou 


tendency, although less 


manic-depressive group to be larger than in the schizophrenic 
group. 


1 Further details re 


garding population and other 
mental group will be 


features of the experi- 
found elsewhere (Eysenck, 1951). 
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In the fourth place, it is interesting to note that those tests 
which had in past work been shown to have high correlations with 
neuroticism, or to discriminate well between normals and 
neurotics, do not show even a tendency to discriminate between 
normals and psychotics. We might mention here such tests as 
persistence, body sway (with and without suggestibility), per- 
severation, work curve inversions, and the word connection list. 

In the fifth place, it is clear that neither the colour-form test 
nor the dissociation test gives results which support Kretschmer's 
hypotheses. There are no significant differences with respect to 
these tests, and the actual figures are not even suggestive of any 
such differences as he predicts. It is possible that the tests used 
by us were unsuitable for a relatively high-grade group of subjects, 
and that with tests better suited to the level more significant 
results might have been obtained. While this argument cannot be 
gainsaid, there is no evidence to favour such speculation in our 
results. 

In the sixth place, it is apparent that there are a number of 
Personality traits which distinguish the normal from the psychotic 
group at or below the 1 per cent level. We may perhaps briefly 
summarize these, emphasizing of course that the terms used here 
should be understood to bear the operational connotation imposed 
upon them by the actual tests used. We find then that psychotics 
are less fluent, perform poorly in continuous addition, perform 
poorly in mirror drawing, show slower oscillation on the reversal 
of Perspective test, are slower in tracing with a stylus, are more 
undecided with respect to social attitudes, show poorer concen- 
tration, have a poorer memory, tend to make larger movements 
and to overestimate distances and scores, tend to read more 
slowly, to tap more slowly and to show levels of aspiration much 
less reality adapted (Eysenck, 1050, 1951). Those who would 
argue that of course psychotics are poorer in all tasks than are 
normals, and that consequently these results are hardly surprising, 
Will have to explain why no such differences were observed with 
respect to the tests enumerated under point four above. 

In the seventh place, no differences were observed with respect 
to a number of tests other than those mentioned under number 
four. These are: social attitudes, such as radicalism, time of writ- 
ing, number of lines drawn. Occasionally, tests undér the same 
general classification gave conflicting results; thus, under the head- 
ing of ‘Fluency’, tests ‘Flowers’ and ‘Words’ showed no significant 
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differences, while tests ‘Birds’ and ‘Animals’ did. Similarly, when 
both the best and the worst of a series of scores on a test were 
scored separately, different results were sometimes obtained. As 
an example, we may select test 7, ‘Concentration’, where two sub- 
tests are used, ‘Letters’ and ‘Numbers’, each consisting of eight 
separate scores which are added to give total scores. For both 
‘Letters’ and ‘Numbers’, the lowest scores do not give significant 
differences, while the highest scores give significant differences. 
This phenomenon could with advantage be studied separately; its 
implications are, presumably, that while normals often do as 
poorly as psychotics on a single trial, psychotics hardly ever do 
as well as normals, even on a single trial. 

This completes our brief summary of the isolated results of the 
experiment, and we must turn now to the study of the patterns of 
intercorrelations by means of criterion analysis. " 

In view of the fact tha5 not all the tests used gave highly 
significant results, and as many of the scores used were not experi- 
mentally independent of each other, twenty tests in all were 
selected for the factorial study. These tests are given in Table XL. 
Also given in this table are the biserial correlations of cach test 


TABLE XL 
5 I II ш IV. 
" Ted Tes Tnd Тер 
1. Overestimation in distance judgment —'160 | —-325 159 | —:233 
2. Fluency—animals "323 203 189 293 
3. Social attitudes—zero responses —'189 | —-255 | 070 —:222 
4. Reading prose—time in secs, — 364 | —-332 | —-og1 | —:335 
5. Three circles—time in secs, -01822 | —-254 | —-140 | —:268 
6. Concentration—numbers 1274 "279 | —:037 278 
7. Tapping—15 secs. :97| 280| —-081 44 
8. Mirror drawing—] scores, average 364 237 228 p 
9. Perspective reversal —slow 290 -089 208 —*199 
10. Perspective reversal —normal 193 294 ~-016 7229 
11. Abstractions—letters remembered '379 195 20 “01 
12: Threc squares—diameter —*397 — 1 =g 
13. Tracing test—J scores, average —'274 |: —-265 016 —:257 
214. Work curve—lowest score 376 262 12 "309 
15. Size estimation—half-crown —:253 | —-332 чой —204 
16. Numbers 1—20; length of writing —250| —-063 | —-216 | —-162 
17. Expressive movements—length of waves 7459 | —:213| —:279 | —:341 
18. Expressive movements—amplitude —'556 | —-242 | —-339 | —-402 
19. YEAR—length of writing —"353 | —-228 | —-097 | —:275 
20. Suggestibility —7005| —:042| 0309 | —025 
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with the following dichotomies: (1) Controls versus depressives; 
(2) controls versus schizophrenics; (3) schizophrenics versus 
depressives (С, а); (4) controls versus psychotics (C,,). The 
product-moment intercorrelations for these twenty tests were 
calculated for the normal (control) group, and separately for the 
psychotic group. The tables are given in Eysenck (1951). 

These two tables were factor analysed by means of Thur- 
stone’s centroid method. Thurstone’s method of sign reversal was 
used until all the column sums were positive; this was followed by 
reflection of pairs of columns in accordance with Holley’s criterion 
(1947). Two factors were extracted from each of the tables; this 
number was decided by means of the following method. (1) A one- 
factor solution was assumed and iterations made until the com- 
munalities from the last two iterations were all within + 005; 
when the first factor residuals were calculated. 28 and 22 of these 
were above the -05 level of significarice in the two tables, where 
19 would have been expected by chance; 12 and 8 respectively 
were above the -o1 level, where 4 would have been expected. It 
was therefore decided to extract a second factor. (2) A two-factor 
solution was assumed and iterations carried out until the last two 
agreed within -+ соот. The second-factor residuals were then cal- 
culated, and in both tables 8 and 2 respectively were found to be 
significant at the -o5 and сот levels, wheve 19 and 4 would have 
been expected by chance. The analysis was terminated at this 
point. Factor saturations are given in Table XLI together with the 
communalities. 

We now have available the data with which to test our hypo- 
thesis. As shown in section one, Kretschmer's hypothesis requires 
that the two sets of factor saturations should be proportional to 
cach other. We must therefore correlate Ё, with Fp, and F,’ with 
F, 3 the hypothesis requires that these two correlations should be 
significantly positive. In actual fact they are + -868 and + :746; 
ч appears therefore that this deduction is borne out by the facts. 

oth factor patterns are remarkably alike. 

The next deduction to be tested relates to our Hypothesis I, 
and states that F, and F, should both be proportional to С». The 
respective correlations are + 895 and + '954- Again we find our 
deduction verified, and it would appear that psychotic;states doin 
fact form a continuum with normal mental states. Gur last deduc- 
tion relates to Hypothesis II, and requires that F,’ and F,’ should 
both be proportional to С, The correlations in question are 
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TABLE XLI 
Normal Group Psychotic Group 
Е, ЕГ | hy? | Ey | Ej hy? 
| Кс = = 
. | "1184 | -036 | 198 1 *044. ‘031 

E sae “342 | 158 | —-379 361-274 
3 119 048 ‘016 | —.o60 -183 :037 
4 `498 —'198 287 | "963 —+165 159 
5 888 | :052 "149 | 396 | 196 à E 
6| —334 | 05 :боо —95 | 452 | 527 
7 — 176 157 :056 —:99 | 226 | ыг 
8 355 | 00558 | 189 | 276 0888 | ч 5 
9 — +263 —'215 |  *r15 | 7378: | “208 19 
10 —:152 —'129 | одо —:973 —-064 2 
11 —'241 `391 | "211 —:448 “170 d 
12 623 | 417 “562 | 174 5171 б 6 
13 329 | —:053 | її 425 -0124 За 
4 —428 | 260 “251 —-497 142 207 
15 084 | 288 | одо "244. 121 on 
16 063 | *508 262 183 | :857 | Ч: 
17 одо | 156 -082 509 | “156 -283 
18 | 4315 :054 "102 587 | 120 *303 
19 1107 "69 | одо “202 579 376 
20 | '007 | —обо Р *004 —:034 —:099 ‘oll 

! 


+ 2029 and + :085, and it willbe clear that in this case the 
deduction is not verified. It would appear to follow that schizo- 
thymia-cyclothymia does not exist as a Separate dimension of 
personality, s 

Before accepting this ne 
appear desirable to study th 


(1950), one feature of this m 


columns in the present study, 
are possible for each of the 


1No attempt is made here to interpret Е, and Ej'. Such interpretation 


useful purpose. The possibility 
ersonality dimension extraver- 
n considered, but until the tests 


may be related to the р 
Eysenck, 19506, 1951) has bee; 
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a vector is found which, in the two-factor space, coincides as 

nearly as possible with the projection of the C,,, column on that 

space, and (2) rotation such that a vector is found which, in the 

two-factor space, coincides as nearly as possible with the pro- 

jection of the C, ; column on that space. In either case, the second 
factor is calculated by keeping it orthogonal to the factor first 
extracted. When these rotations are carried out the following 
results are obtained: К 

Correlations between f, and f£, and Ё, and Ё, are now 
+ 858 and + -769 respectively. Correlations of Ё, and Fp with 
С» are now + :894 and + :951 respectively, while correlations 
of Е, and E, with С, are still quite insignificant. Rotations 
according to (2) are rather more interesting. Correlations between 
(F,) and (F,) and (F,') and (F,’) are now + 830 and +793 
respectively. Correlations of (F,) and (Fp) with Csa are now 
+ +548 and +- -679 which seemingly indicates that here we do 
have some justification for speaking of a schizothymia-cyclothymia 

' factor. But this putative factor can be shown to have no real 
meaning when we compute the correlation between C,,, and Ca; 
which turns out to be + 5550. This is simply a confirmation of a 
point made before, viz. that differences between schizophrenics 
and manic-depressives tend to be in the same direction as differ- 
ences between normals and schizophrenics, and that therefore the 
schizophrenic group tends to be intermediate between the others. 
In other words, the correlations of (F,) and (F,) with Csa appear 
due entirely to the correlation of C, with С, „, and again we find 
no evidence whatever for the existence of a schizothymia-cyclo- 
thymia factor, 

This conclusion, like others in this chapter, should not be 
taken as in any way definitive. It is possible that a selection of tests 
which gave more scope to schizophrenic-depressive differences 
might produce results more in line with Kretschmer's hypothesis, 
although the fact that tests of his which were included did not 

succeed in producing this discrimination makes it somewhat un- 

likely that very different results would be reached with a different 
selection of tests, It is perhaps significant that wherever results in 
this experiment are positive, they are very decidedly so, and where 
they are negative, they are equally decidedly negative; the support 
for the first hypothesis investigated, and the failure to find any 

Support for the second hypothesis, are equally impressive in their 

decisiveness, 


2 
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We have shown, so far, that neither neurotics nor psychotics 
are something sui generis, qualitatively different from normal 
people; instead, we have been able to show that there exists a 
‘neuroticism’ continuum linking normals with neurotics, and а 
*psychoticism" continuum, linking normals with psychotics. The 
question immediately arises: Are these two continua identical, or 
are they independent of each other? Identity would presumably 
be assumed by the Freudian theory of ‘regression’, according to 
which there is a continuum ranging from the normal, well-adapted 
personality, through the neurotic, partly regressed patient, to the 
psychotic, almost completely regressed. Independence would be 
assumed by most of the psychiatric textbooks written along 
Kraepelinian lines. The problem is clearly an important one, 
_indeed it is fundamental for any useful taxonomic system. There 15 
по doubt that it is eminently amenable to a systematic ехрегі- 
mental attack. . 
There are certain fairly obvious clinical findings which make 
the ascumption of one common continuum somewhat unlikely. If 
we were dealing with one single continuum, normals developing 
a psychosis should pass through a state of neurotic disorder first; 
similarly, neurotics should be far more susceptible to the develop- 
ment of a psychotic illness than normals. Neither of these deduc- 
tions appears to be verified by psychiatric observations. Somewhat 
more definite are certain findings from military experience. There 
is a clear, monotonic rise in neurotic breakdown rate with increas- 
ing length of military service; there is no such rise in psychotic 


breakdown rate. There is a clear drop in neurotic breakdown after 
the cessation of hostilities, 


more formal proof of this 
€ type of deduction which 
es between which we аге 
nuum’ hypothesis, we are 


Pointing out the facts to me, and for 
hich the above conclusions are based. 
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trying to decide. On the ‘single conti 
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dealing with three groups—normals, neurotics, psychotics—whose 
mean ‘regression’ scores lie in that order along a single continuum 
or dimension. We would, therefore, expect that those tests which 
discriminate significantly between normals and neurotics should 
also discriminate with even greater significance between normals 
and psychotics. We have already seen that this is not so. Tests like 
suggestibility, static ataxia, persistence, and the Word Connection 
List discriminate between normals and neurotics at a high level of 


LENGTH 
ESTIMATE 
51 [E3] 
SCHIZOPHRENIC 
50 
49 MANIC- DEPRESSIVE 
ЇЕ 
А 
HYSTERIC 
48 
© A 
NORMAL IL PSYCHOPATHIC 
S Ф A 
NORMAL 1 ANXIETY 


" 2 13 4 15 16 7 18 
WORD CONNECTION LIST 
i. Figure 28 

Significance, but fail completely to discriminate between normals 
and psychotics. Other tests, like the large selection of expressive 
movement tests described in Eysenck (19506, 1951), discriminate 
at а high level of confidence between normals and psychotics, but 
ail to discriminate between normals and neurotics. These results 
Have already been mentioned in passing. It may be more en- 
ghtening to present comparisons based on a few selected tests in 

diagrammatic form. ^ 2 
Figure 28 shows a plot in which the ordinate and the abscissa 
Tespectively represent scores on two tests (the Word Connection 
ist and a test of Length Estimation, scored for amount of over- 
estimation), the first of which discriminated ‘very significantly 
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between normal and neurotic groups, while the secon 
criminated very significantly between normal Е - i. 
groups. Plotted in the diagram are the mean scores 9 MC. (one 
groups who were given the two tests, two Gea a EM 
manic-depressive, the other schizophrenic), and t Эн suffering 
groups (one hysteric, one psychopathic, and the one оно 
from anxiety). Оп the basis of the ‘single continuum" E lind 
we would expect these means to lie on a more or less straig 


LENGTH 


ESTIMATE 
51 


50 


PSYCHOTIC 


49 н 


48 


® 
47 NORMAL 


WORD CONNECTION LIST 
Figure 29 


е 
with the normal groups at one end, the psychotic groups a t 
other, and the neurotic groups intermediate. It will be seen Ён 
the facts do not bear out the hypothesis, It is impossible to reg 
the results as compatible with a single-dimension hypothesis; M 
need at least two dimensions to accommodate the triangu 
normal-psychotic-neurotic grouping. This is even more appare 
in Figure 29, where the average results аге plotted for the normals: 


з : : Е it 
psychotics, and neurotics respectively, using total groups not spl 
up into their'constituent parts. Fi 


; : gures 30 and 8: bear out tht 
impression; using two other sets of tests (suggestibility and 2? 
expressive movement test described in 


Eysenck ( 1950) in the ono — 
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and tapping rate and static ataxia in the other) we find ү їп 
neither case can the three means be arranged in a linear form, 


AVERAGE SIZE 


ОР v 
по 
Б] 
PSYCHOTIC 
105 
100 
95 A 
[C] NEUROTIC 
NORMAL 


Ц 2 3 4 5 
SUGGESTIBILITY 
Figure 30 
TAPPING 
68 А 
РЅҮСНОТІС 

69 

70 

7! 

72 


NEUROTIC 


го 15 2-0 
STATIC ATAXIA 


Figure 31 А 
Suggestive of one underlying dimension or сопйпицүл; in each case 
а two-dimensional space is required. 


These fairly clear-cut examples do not of course constitute a 
8.8.Р, 7 9, 
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formal proof; no effort has been made to apply formula to these 
results and show the statistical significance of the deviations from 
a. straight line. As a demonstration ad oculos, these diagrams 
appeared sufficiently striking to demonstrate the logic of the argu- 
ment underlying our proof; for the proof itself we must turn toa 
somewhat more formal demonstration. Two such demonstrations 
will in fact be given, the first more definitely experimental, the 
Second more statistical in nature. 

The experimental study alluded to was carried out by F. 
Freeman (1951), using the Character Interpretation test illus- 
trated in photographs 29 and 30. The nature of the test will be 
apparent from these reproductions. Twelve photographs of human 
faces—male and female, adult and child—are given to the subject, 
together with a set of 24 adjectives for each. The various sets к 
adjectives are divided into two lists of 12 each, called List A an' 
List B; instructions are to pu: a check mark beside the four words 
in List A which best describe the person shown in the picture, an 
then tG put a cross beside the four words in List A which least 
describe the person. The same is then done for the adjectives 11 
List B. 

The twelve adjectives in each List are selected according Ox 
scheme which is identical for all the pictures. Two adjectives 
relate to feelings of (a) hostility, two to feelings of (b) self-import- 
ance, two to (с) immorality (other than sexual), two to (9) sexua 
immorality, two to (е) depressive feelings, and two to (f) fearful- 
ness and insecurity. One of the two terms in each case is worde 
positively, the other negatively; ‘vir 
form a contrasting pair of word. 
and ‘undisturbed’ would form a similar pa 


Negative (unfavourable) reactions ( 
beside an unfavourable 


in 
At the fourth level, all categori Е 
; Sores tora. Picture are combined, 
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giving a general index of liking or disliking for that picture. And 
at the highest level, all these indices are combined in a grand total, 
expressing an individual's likingor disliking towardsall the pictures. 
The general hypothesis underlying this test will be apparent 
from the description. Differences between subjects in assigning 
character traits to the people whose pictures constitute the test 
may be presumed to be related to the personalities of the subjects 
themselves; if that be true, it would follow that the scores of men 
would differ from the scores of women, or that the scores of 
neurotics would differ from those of psychotics, and those of both 
groups from the scores of normal subjects. 
These conclusions would only follow, of course, if our system of 
scoring possessed a certain amount of reliability; like any objective 
Scoring system for what is essentially a projective test (properly so 
called in this instance, as the mental mechanism hypothesized is 
that of Projection) the one employed here may be putting together 
9n a prior grounds scores which psychologically are quite un- 
related, Reliabilities were calculated at various levels of cotnbina- 
Чоп, and are sufficiently high to show that such a criticism would 
not be entirely justified. Reliabilities at the third level (List A 
Versus List B for pairs of adjectives) for тоо normal male subjects 
average around "40. At a higher level, we compare total ‘like’ 
Score for a particular category in List A for a given picture with 
total ‘like’ y 
Ow are.given the reliabilities, averaged over all 12 
for roo normal, 50 psychotic, and 50 neurotic subjects, 

all of them male. The values are: -67, -68, and :67. Considering 
the small number of choices which makes ир each score, these 
reliabilities are rather encouraging. Neither age nor intelligence 
were found to be correlated with the scores. 

. € May now return to the main reasom why we introduced 
а discussion of the Character Interpretation test into this chapter. 

our assumption is justified that normal, psychotic and neurotic 
Subjects will react differently to the stimuli provided, and that 
Cir reactions are the product of fundamental personality traits,” 
then it would follow that on the Freudian hypothesis of a single 
normal-neurotic-psychotic dimension the psychotic deviations 
from the normal pattern should be in the same "direction as the 
neurotic deviations, but more strongly marked. If ng such general- 
ization could be derived from the data, then the ‘single con- 
tinuum’ hypothesis would appear to be invalidated. 
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In order to avoid the necessity of quoting the extremely 
lengthy and complex material relating to all the pictures in this 
test, we shall give details only for the two pictures reproduced in 
this book as photographs 29 and 30. The pictures represent а 
male negro and a white woman respectively. In the case of the 
negro, the scores for categories a to f, summed over Lists A and B, 
are as follows: 26:56 for 100 normal male subjects; 23:00 for 50 
male psychotics; and 28-84 for 50 male neurotics. Two of the three 
differences are significant (normals versus psychotics) or very 
significant (psychotics versus neurotics), and it will be noticed that 
neurotics and psychotics differ from normals in opposite directions; 
the psychotics give more ‘like’, the neurotics more ‘dislike’ re- 
sponses than the normals. On the other picture, the scores of the 
same three groups are respectively 30:26, 25:52, and 29:72; the 
normal-psychotic and the psychotic-neurotic differences are both 
very significant. In this case ihe psychotics again have higher ‘like’ 
scores than the other two groups, but the neurotics are not in any 


to the normal-neurotic-psycho- 
he Freudian hypothesis. Other 
al-psychotic, or normal-psychotic- 
‘equently, thus suggesting that We - 
ingle continuum. 


test items. On the first test (a 
liked best) the sequence of scores js normals (12-58) psychotics 
(15:04), апа neurotics (17-80); ай these Bosse аг significan 
ог very significant. Normals, apparently, consider themselves like 

best, neurotics consider themselves liked least, with psychotics i 
between. On the second test (adjectives marked as by the perso? 
liked least) the sequence of scores is normals (21-06) psychotics 
(21-32), and ^eurotics (24°84); neurotics differ significantly fro 

both normals and psychotics. Here again, normals consider the™ 
selves liked best, neurotics consider themselves. liked least, with 
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psychotics in between. These results also fail to Support E. 
‘single continuum’ hypothesis, at a high level of statistical signifi- 
cance. 

We must now turn to a somewhat more statistically oriented 
study. The method used in it has already been alluded to in our 
discussion of the work of Rao and Slater, in which they showed 
that psychiatric ratings of hysterics, psychopaths, obsessionals, 
anxiety states, and normals on thirteen points could be explained 
entirely in terms of one dimension (identifiable with neuroticism); 
without requiring any other continuum than differences in severity 
of neurosis to account for diagnostic differences between the groups: 
The method to be used has been developed independently by 
several workers including C. R. Rao (1948) and is based on 
Fisher’s maximum likelihood principle; it has obvious similar- 
ities with the independently developed solution of Penrose (1947) 
-and Smith (1947), and falls: under the general heading of dis- 
criminant function analysis. Rao & Slater (1949) essentially used 
certain: linear discriminant functions called ‘canonical’ to give 4 
Scometrical representation of the differences between various 
neurotic sub-classifications; their method requires an answer to the 
dimensional problem—how many dimensions are sufficient po 
represent all the groups measured?—and supplies a test of signifi- 
cance which is unfortunately lacking in the traditional factorial 
approach to this question. Lubin (1950) has put Rao's method 
into a form in which it can more easily be understood by workers 


in the psychological field, and has also carried out the analysis on 
which this. section is based. 
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E.C.T., or be subjected to leucotomy. In a similar way, although 
intelligence is generally considered to be a continuous variable, 
the decision to label a person as ‘mentally defective’ is in essence 
based on a system of classification in which everyone is either 
‘normal’ or ‘subnormal. Categorical decisions are essential 
in practice; it is the task of science to aid in the correct 
choice of categories, and to minimize the number of misclassifi- 
cátions. : 

The best available method for classifying individuals into 
mutually exclusive categories on the basis of quantitative scores is 
Fisher's discriminant function (1936, 1938). It is applied in the 
present case to three groups of subjects—50 normals, 50 psychotics, 
and 50 neurotics—of roughly equal age and sex distribution. None 
of the psychotics were certified, and none were of the chronic, 
deteriorated type. The normals were chosen on the usual negative 
criterion of not being to our knowledge under treatment for mental 
disorders of any kind. It is almost certain that the criterion is far 
from perfect, and that if the whole group of 150 subjects were to 
Бе seen and diagnosed by a psychiatrist ignorant of their previous 
diagnosis, а correlation of less than unity would be found. This 
fact will seriously attenuate our results, but is unlikely to lead to 
exaggerated claims as to the value of the method. Four tests of 
manual dexterity were given to all the subjects, the tests chosen 
being tests M, N, O, and P from the General Aptitude Battery of 
the United States Employment Service, whose kindness in allow- 
ing us to use their tests is acknowledged with gratitude. (The tests 
are illustrated in photographs 10, 20, 23.) Again, the choice of 
tests was such that our findings would constitute a minimum 
rather than a maximum claim; it is well known that the higher 
the intercorrelations between tests, the less likely are they to make 
independent contribution to a multiple selection problem. It will 
be clear that if normals, psychotics, and neurotics сап be dis- 
Criminated successfully on the basis of four manual dexterity tests, 
then much better discrimination could be achieved on the basis 
of a larger number of more diversified tests. However, we are here 
concerned more with the method than with the results, and even 
with the restrictions imposed on the solution by the choice of tests 
and subjects, results will be shown to be markedly positive and in 
accord with our hypothesis. МИЙ 
, The first step in the procedure requires the condensation of the 
Tour tests into the two canonical variates that will best differentiate 
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the three groups in two dimensions; the second step requires the 
application of Rao's likelihood solution for classification. Table 
XLII shows the results of the first step; in it are given the means, 
variances, and correlation ratios of the two canonical variates, Y: 
and Ү,. It will be seen that Y, has a correlation of -64 with the 
trichotomous classification, while Y, has a correlation of -45 with 
it. The first of these is the largest possible correlation ratio of any 
linear combination of the four tests with the criterion, while the 
second is the largest possible correlation ration of any linear com- 
bination of the four tests which at the same time correlates zero 
with Ү,. In combination, Y, and Y, correlate with the criterion 
to the extent of -78. 


TABLE XLII 
те = Е 
| ? | 5 а. | RR. la lation 
| Normals Neurotics | Psychotics | Ra Ho. 
Yi | 79:240 | 66:540 | 59:420 | “64 
Y. 40:660 48:560 | 39:280 | 45 
5,2 79166 | 55402 136-167 | 
Sa? | 56596 | 97639 | 64247 | 


Before calculating Y, and Ү,, we must of course test the latent 
roots for significance. It is always possible to represent the group 
means in m-1 dimensions; before using this number of dimensions, 
however, (two in this case) we must show that the means do not 
lie along one dimension, or coincide at one point. In other words; 
we must show (1) that there are significant differences between 
the groups, and (2) what the minimum number of dimensions is 
which is required to represent the differences between the group 
means. It is the possibility of such a test of significance that consti- 
tutes the main claim of this procedure to our interest. According (0 
Bartlett’s (1938) test, both roots are significant at the P = ‘00! 
level. We may therefore conclude that two dimensions are neces 
sary and sufficient to account for our observed test data. Inter- 
pretation of Y; and Y, will be attempted later on; here we may 


1 The canonical transformation of the original variables is not a necessarY 
step (Slater, 1948); the likelihood that an individual belongs to any one of the 
groups can be calculated by using likelihood functions based on the original test 
results directly, Tiis latter method cannot be less efficient than the former, ай 
it may occasionaliy be significantly more efficient. However, it is very mu 


more laborious. Both solutions were calculated, and the difference betwee? 
them found to be insigrificant, ! 
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point out simply that inspection of the values in Table XLII sug- 
gest that Y, is identical with our ‘psychoticism’ factor, and Y, with 
our ‘neuroticism’ factor. : 
_ If we wish to go on and assign our 150 subjects to their appro- 
priate class on the basis of their test scores, we must transform the 
canonical variates, Y, and Y, into likelihood functions of the Rao 
type. This procedure, which will minimize the proportion of mis- 
classifications, consists of calculating the likelihood for each 
individual of belonging to each of the three groups, and then 
assigning him to that group for which he has the maximum likeli- 
hood. Table XLIII gives the results: it will be seen that 71 per . 


T TABLE XLIII 
а Se See 
| Psychotic Neurotic | Normal Total 
aa | 
i | 
коса Psychotic | a a 9 3 46 
i Neurotic | “9 | 32 6 47 
embership | Normal 7 | 9 4t 57 
——— - E к Pc 
| 150 
== | = | |= ENSE : ms: 
Percentage correctly classified: | 689, | 64% | 82% 713% 


E eor | 


cent of all cases are correctly classified.! A non-parametric test of 
significance (Lubin, 1950) applied to this table shows that mis- 
classifications are very significantly fewer than chance (P < соот). 
‚ As was pointed out earlier, the quadratic likelihood solution 
using the canonical variate scores is not the only one possible. We 
шау use a linear rather than a quadratic solution, and we may 
use the original test scores rather than the canonical variates based 
on them. These various methods give rather similar results. Results 
of the various methods are compared in Table XLIV. 
: A visual presentation of the data is given in Figure 32. The 
Wo canonical variates, Y, and Ү,, constitute the ordinate and the 
abscissa: the mean values of the 1 50 subjects are plotted according 
5 their scores on Y, and Y, The three lines which split the 
diagram into three parts are the discriminant function lines, giving 


* It will be seen that the total number classified into са zh group does not 

аа with the number known to be in that group. This is an obvious 

ifficulty of this method which has received а proper solution only recently 
(Rao, 1948). з а 


о 
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TABLE XLIV 


Percentage Correctly Classified 


ЕД Variables Used - | Е — 
й ` Total | Normal | Neurotic | Psychotic 

| о/ °/ 27 

о 40 19 

Quadratic Canonical variate 71:3 | 82 64 68 
Linear Canonical variate 68-7 78 58 70 
Quadratic Test scores 72-7 80 66 72 
Linear Test scores - 68-7 80 68 68 


1 


T Predicted group membership on the basis of the quadratic likelihood 
canonical variate scores. z 


58 
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70 


76 


NORMAL 
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GROUP MEAN 


35 4 


47 55 
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the lowest degree of misclassification. The resemblance of the 
positions of the three groups to those of the three groups in Figures 
29, 30, and зт will be noted. It will also be noted that the position 
of the psychotic and normal means, which are parallel to the Y; 
axis, justifies our interpretation of this canonical variate as 
psychoticism’, while the fact that Y, discriminates the neurotic 
mean from the two other means may serve as justification of our 
Interpreting this canonical variate as one of ‘neuroticism’. 

1 Not too much importance is attributed to the actual discrimin- 
ation achieved. It is well known that statistical combinations of 
several scores in order to give maximum correlation with a 
criterion tend to give spuriously high values because of the utiliz- 
ation of chance errors which happen to correlate with the predicted 
variables. The fact that the zero-order correlation ratios are rela- 
tively high—being equal to 550, 57, 756, and 551 respectively for 
tests M, N, O, and P—makes it unlikely that the multiple R would 
drop very much on repetition of the experiment (two such repeti- 
tions are under weigh at the moment). Also it is clear that a more 
Careful selection of subjects, and a more varied selection of tests, 
Would result in substantial improvement on the figures given here. 

owever, our main purpose in quoting this study lay not in the 
demonstration of the possibilities for differential diagnosis of objec- 
Чуе tests (welcome as this proof may be); it lay in the demonstra- 
дой that two dimensions are required in.order to account for the 
distribution of scores made by normal, psychotic, and neurotic 
subjects. This conclusion agrees with that derived from our 
factorial studies, and: adds the important element of statistical 
Significance; where previously our results had to be evaluated 
More or less on an impressionistic basis, they can now be stated in 
terms of exact P values. 


Em. 


Chapter Seven 


APPLICATIONS OF DIMENSIONAL ANALYSIS 


an attempt has been made to apply the dimensional or 
to various practical problems. Included are studies into E 
after-effects of prefrontal leucotomy, into the employability o 
- mental defectives, into work adjustment and productivity of un- 
skilled factory workers and into the selection ofstudents and nurses. 
These studies extend our knowledge in two directions. They make 
use of populations—mental defectives, nurses, factory workers, 
teachers—which are quite unlike those on whom our dimensions 
were first established; the fact that very similar results appear "i 
these new populations to what we had observed in the pue 
groups lends strong support to the universality of the phenomena. 
And secondly these studies bring into the universe covered by our 
tests concepts and variables not previously dealt with. Produc- 
tivity, work adjustment, social relations and ability to get on with 
people, popularity—these are only a few of those used in these 
studies. Nor have we failed to follow the dictates-of the hypothetico- 
deductive method even when dealing with these severely practical 
problems. In each case a definite hypothesis was stated, and the 


investigation designed in such a Way as to provide evidence for or 
against the hypothesis. 


| this chapter a number of investigations are described in which 


(1) Post-operative Effects of Leucotomy 


Perhaps the most interestin 


g and important of the series of 
studies summarized in this cha: 


Pter is the investigation of the after 
effects of leucotomy, carried out by A. Petrie (1952). Its import- 


ance lies not only in the practical necessity of knowing just what 
sort of personality change may be expected to occur after the fibres 
connecting the frontal lobes with the thalamus have been sec- 
tioned; it sheds light on theoretical issues of fundamental interest. 
At the same time, this study illustrates very well a point on which 
stress has been laid before, namely, the importance of making fall 
use of the hypothetizo-deductive method in psyci:ological investi- 
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gation. Too much work has been done on the after-effects of 
leucotomy in a spirit of blind empiricism, of trying any test that 
came to hand that might be relevant; the almost completely 
negative results of this approach will be familiar to most readers, 
and are catalogued in Crown's very able survey of the field (1951). 
What is required, surely, is first of all an hypothesis, couched in 
operational terms, as to what after-effects would be expected, 
followed by an experimental design which permits unambiguous 
verification or refutation of deductions made from this theory. 
Even if unsuccessful, such a procedure would increase our know- 
ledge far more than any number of ad hoc empirical tests based on 
no clear underlying hypothesis. 

The results of the almost universal failure to make use of this 
method have been rather melancholy. Bronfenbrenner (1951) has 
stated the position admirably: ‘It is difficult for the would-be 
theorist to avoid being forced in orfe of two dissociated directions. 
If he covets his reputation as a scientist, he is under pressure to 
confine himself to the analysis of relatively simple phénomena 
where the variables are few, discrete, and susceptible to rigorous 
experimental control. The most significant aspects of human 
behaviour, however, are not likely to be found in this category, 
for they are characteristically elusive and multideterminate. To 
wrestle with these at a realistic level and at the same time to face 
up to the expectations and criticisms of fellow-scientists take more 
time, energy, patience, and self-integration than many able men 
command, It is far easier to remain free from such demands by 
doing one’s theorizing in a non-scientific context. As a result, it is 
perhaps possible to say—with only moderate exaggeration—that 
the study of human behaviour in America shows a bimodal dis- 
tribution with undisciplined speculation at one mode and rigorous 


at the other.’ This study will illustrate how it is possible to 


sterility 
through the use 


achieve scientific rigor, while avoiding rigor mortis, 
of the hypothetico-deductive method. 

The first two hypotheses advanced by Petrie are illustrated in 
Figure 33. Taking her cue from the isolation of the two personality 
dimensions of neuroticism and introversion-extraversion, she posits 
that (1) neurotic patients after leucotomy will show a decrease in neuroticism, 
and (2) that neurotic patients after leucotomy will show an increase in 
extraversion. An additional hypothesis relates to anether personality 
dimension, this time in the cognitive field, and states that (3) 


neurotic patients "fter leucotomy will show a decrease in intelligence. Tt 
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should be noted that these predictions apply to neurotic patients 
only; the changes to be expected with psychotics will be discussed 
later in connection with the experimental work of Crown ( 1950)- 

'The use of such terms as neurolicism, extraversion, and intelligence 
presupposes some sort of operational definition, as otherwise no 
testable deductions from these hypotheses are possible. Six tests, 
taken in the main from Dimensions of Personality, were used. to 
measure neuroticism: body-sway suggestibility, manual dexterity, 


EXTRAVERT 


NORMAL NEUROTIC 


ORIGINAL 
POSITION 


INTROVERT 
Figure 33 


smoothness of work-curve, self-rating scales, tempo of handwriting, 
and disposition-rigidity. The prediction made on the basis of the 
original hypothesis requires the following changes after leucotomy* 
decrease in suggestibility, improvement in manual dexterity, 
greater smoothness of work-curves, less self-criticism and fewer 
inferiority feelings on self-rating scales, greater speed of hand- 
writing, and less disposition-rigidity. 

Another set of six tests 
accuracy /speed, ratio, introp 
humour, verbal/non 
a preoccupation wit 


Was used to measure introversion: 
unitiveness, relative dislike of sexua 
-verbal intelligence test ratio, persistence, ап 

h the past rather than with the present or the 
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future. The prediction made on the basis of the original hypothesis 
requires the following changes after leucotomy: lower speed/ 
accuracy ratio, decrease in intropunitiveness, increase of sex- 
humour appreciation, lowering in the verbal/non-verbal intel- 
ligence test ratio, lessening in persistence, and a lessening in 
pre-occupation with the past. 

Regarding the changes in intellectual capacity, the Wechsler- 
Bellevue and the Porteus Mazes tests were used, as well as the 
‘Proverbs’ test from the Revised Stanford-Binet test. The direction 
of changes to be expected will be self-evident from the original 
hypothesis. 

Two separate types of operation were performed. The first of 
these is the so-called Posterior Standard Leucotomy, described in 
detail elsewhere (McKissock, 1943). The other type of operation, 
the Anterior Rostral Leucotomy, is an open bilateral operation, 
carried out through a pair of burr holes under direct vision. It is 
more anterior, and therefore presumably less damaging, than the 
Posterior Standard Leucotomy; full details will be found elsewhere 
(McKissock, W., 1949). Two further hypotheses were based on 
the differences between these two types of operation. In view of 
the fact that the Rostral operation is less far-reaching, it was 
posited (4) that the same changes would be found as in the Standard 
operation, but that their extent would be less extreme. The other deduction 
is based on the assumption that the great variability in the after- 
effects of the Standard operation may be due to operative vari- 
ability consequent upon lack of visual control, a lack not present 
in the open Rostral operation. It would seem to follow, if this 
reasoning be correct, that (5) there should be less variability in the 
after-effects of the Rostral operation. 

The patients operated upon were severe neurotics, primarily 
those suffering from obsessional symptoms and from anxiety; they 
would in the main fall into the category of *dysthymics' as described 
in Dimensions of Personality. The same principle of selection was 
followed in choosing patients for either type of operation, and 
there is little reason to suspect the group of patients who were 
given the Standard operation to have been different in any im- 
portant respect from the group of patients who were given the 
Rostral operation: 20 patients in the Standard group were tested 
before the operation, and retested three and nine months after the 
operation; 15 patients in the Rostral group were tested before the 
operation, and Zetested six months after the operation. Some 
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additional patients in the Standard group only received one retest; 
some additional patients in the Rostral group received. only па 
lateral incisions. Results from the main groups only are summarize 
here; a full report will be found elsewhere (Petrie, 1952). 4 

'Тһе results of the tests as applied to the Standard operation 
group of patients are given in Tables XLV, XLVI, XLVII; these 
figures, it should be noted, are merely a briefsummary of the many 
lengthy and detailed tables given by Petrie. Regarding the p 
dicted decrease in neuroticism, it will be seen that the direction © 
the change for all six tests is as predicted; this is true both for the 
three-months and the six-months retest. Most of the C. R.s are fully 
significant, and for every test there is at least one significant change 
in the predicted direction—either after three or after nine months. 
We may conclude that with respect to this dimension of person- 
ality, the original hypothesis is fully borne out. 


TABLE XLV 


CHANGES AFTER LEUCOTOMY: CRITICAL RATIOS 


3 months | 9 months 


(1) Neuroticism 
(a) Body sway suggestibility | 


2:33 2181 

(b) Manual dexterity | бл | 395 

(c). Smoothness of work curves | 2:34 | NS. 

(4) Self-rating scal_ | Ns. | 286 

(e) Tempo of handwriting | N.S; 2:43 

(7) Disposition-rigidity | rgo | 2:12 
1 2 


With respect to the Introversion-Extraversion dimension, the 


results are equally clear. Again all tests change in the predicte 
direction, and with one exception all 


; the changes are significant at 
a high level of confidence. (It would have been permissible in all 
cases to use the one- 
arbitrary statistical 
two-tail test in со 
Standard operatio. 
differences.) 

With respect to intelli 
The only test which does 
is the Wechsler 
and the Porte 
highly signific 


manipulation of data, however, Petrie used the 
nnection with computations arising from the 
n, thus reducing the chances of finding significant 


gence, again the figures are very cleat 
not show any declines after the operatio? 


us Mazes test all show the predicted decline at ? 
ant level. Certain points ought to эе borne in min. 


tail test. In order to avoid any possibility of 


Performance scale; the Verbal scale, the Full scale, | 
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here, however. It is somewhat artificial to divorce the cognitive 
functions from personality as a whole, and the fact that the ratio 
of verbal to non-verbal performance on intelligence tests is a good 
measure of introversion (Himmelweit, 1945) shows the close rela- 
tion which obtains between temperament and intelligence. The 


TABLE XLVI 
CHANGES AFTER LEUCOTOMY: CRITICAL RATIOS 
3 months 9 months 
(2) Extraversion-Introversion 
(a) Speed/Accuracy ratio 1:10 1555 , 
(b) Intropunitiveness 2:53 ТОЛИЙГ ар 
(с) Sex humour liking Significant * | Significant * 
(d) Verbal/non-verbal Intelligence test Significant * | Significant *- 
(e) Persistence 2:82 2:42 
(f) Preoccupation with past ə | Significant * | Significant * 


* Data in original significant but not expressed in terms of C4R. 


differential decline here found—marked with respect to verbal 
intelligence, non-existent with respect to performance—is exactly 
what would have been predicted on the basis of Petrie's second 
hypothesis. The Porteus test also involves personality factors out- 
side the purely cognitive field; according to hypothesis (2) speed 
and accuracy on this test should change, after the operation, and 


TABLE XLVII 
CHANGES AFTER LEUCOTOMY: CRITICAL RATIOS 


3 months 9 months 
(3) Intelligence 
(a) Wechsler Verbal 3:09 


481 309 

(b) Wechsler Performance 0:24. 1:04. (improved) 
4:02 
3:14 


(c) Wechsler Full Scale 223 
(d) Porteus Mazes | 767 


indeed both changes are observed (C.R.s 2:47 and 3:25 respec- 
tively for the three months’ retest). No figures are given for the 
Binet Proverbs test, but qualitative observation of responses leaves 
little doubt of considerable deterioration in ability to generalize. 
"These three tables then verify in almost every detail the pre- 
diction made, aud suggest very strongly that in this type of patient 
S.S.P. R 
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a marked change takes place with respect to his p Hes 
three dimensions of personality involved. Much 5 декат іп M 
tion is given in Petrie's book which supports this m d ^ 
we have no space to discuss it in any greater detail. т 1 E 
must now turn to the changes which follow after the open Ro 
eration. 

е эсе аге summarized in Table XLVIII; ын ко 
in the same direction as those reported in connection with 


Standard operation, and most of them will be seen to be significan 


Affecting any direct comparison is the fact that the «x a 
patients submitted to the two types of operation is палае dod 
Will be remembered that 20 patients had the Standard opera E 
done, while only 15 had the Rostral operation donc. When b 
qualification is borne in mind, there remains little doubt ын 
the non-cognitive changes observed after the Rostral operation | ha 
identical in direction, but smaller in extent, compared with er 
changes observed айсг the Standard operation. ‘Thus Lr E 
(4) is seen to be verified, and in so far as the results are Жү 23 
hypotheses (1) and (2) they must be regarded in the nature 2d 
replication of the first experiment, thus establishing our faith in 
reproducibility of these results on a much firmer basis. 


TABLE XLVIII 
CHANGES AFTER ROSTI 
(1) Neuroticism 


(0) Body Sway suggestibility 


RAL LEUCOTOMY: CRITICAL RATIOS 


(P) Manual dexterity 2:64 
(c) Smoothness of work curves N.S. 
(d) Self-rating scale 2°93 
(e) Tempo of handwriting 4°67 
(f) Disposition-rigidity r85 

(2) Extraversion-Introversion 
(a) Speed/Accuracy ratio N.S. 
(b) Intropunitiveness A 2:07 
(с) Sex humour likin Significant * 
(4) Verbal/Non-verbal Intelligence test N.S. 
(е) Persistence N.S. 
(F) Preoccupation with past 


Significant * 
(3) Intelligence 


(a) Wechsler Verbal 


1127 
(b) Wechsler Performance 1:65 
(c) Wechsler Full Scale 1-21 
(4) Porteus Mazes 0:33 


* Data in original significant but Dot expressed in terms of C.R. 
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This confirmation only applies to the non-cognitive changes, 
i.e. to changes along the dimensions of neuroticism and extra- 
version-introversion. Changes in intelligence level do not follow 
the pattern observed after Standard operations; the obvious loss 
found in patients operated on according to the Standard procedure 
is absent in patients operated on according to the Rostral pro- 
cedure. This result argues in favour of a specific location hypothesis 
with respect to certain aspects of intellectual functioning, and may 
thus explain the contradictory reports on intellectual after-effects 
of leucotomy which have characterized the literature (Crown, 
1951). While this finding thus opens up exciting new research 
prospects, it is not intimately connected with our main theme, and 
will not therefore be pursued further. 

One way of illustrating the fact that in the cognitive field the 
effects of the two operations are dissimilar, while in the non- 
cognitive field they are similar, is to calculate correlation coeffi- 
cients between the critical ratios of the differences in test scores 
following on the two operations. Comparing the first Standard 
operation retest with the Rostral operation retest, the correlations 
for cognitive and non-cognitive test С.К. are respectively — -400 
and + :211; comparing the second Standard operation retest with 
the Rostral operation retest, the correlations аге — :143 and 
+ 1360 respectively.! Thus after the longer period the changes in 
temperament and character had become increasingly similar in 
the two operations, while the dissimilarities between the cognitive 
after-effects on the two operations had decreased. (It is quite 
justifiable to use C.R.s in this fashion, although the number of 
patients in the two groups is different.) 

We may now turn to the fifth hypothesis, which used the obvious 


. differences between an open and a closed operation to predict 


greater variability of after-effects in the closed operation. To inves- 
tigate this hypothesis, the S.D.s of the differences in tests on which 
significant changes were found after both types of operation were 
compared. The results favour the hypothesis strongly with respect 
to measures of introversion-extraversion, but are ambiguous with 
respect to measures of neuroticism. We can only regard this 
hypothesis as partially confirmed, therefore, and must await a 
repetition of the experiment, possibly with larger numbers of 


1 The number of tests involved in these correlations is 19 214 32 respectively 
for the first retest, апо 19 and 41 for the second retest. The other tests included 
are fully described in Petrie ( 1952). ° 
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subjects, before deciding definitely either for or against it. It should 
be remembered, in this connection, that while Meyer (1949, 1950) 
and others have shown that there is considerable variation in the 
incisions made in the Standard operation, it has not yet been 
ascertained that the open Rostral operation does actually lead 
to more precise results. Possibly part of the variation in the cut 
may be duc to the elasticity of the brain tissue, and part to hemor- 
rhage and interference with blood supply. This is a complex 
problem, and no final conclusion can be claimed. 

In summary of this research, we may say that a large number 
of highly significant changes has taken place after prefrontal leuco- 
tomy, a finding which in itself contrasts markedly with the usual 
dearth of positive results in this field. These changes are in almost 


cated by the results obtained with the Rostral operation, although 
in a less marked manner. T 


in line with the original hy 
What more marked after th 
but although this result is 
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psychotic patients, retested three months after the original opera- 
tion (1951). 

Relevant results include a slight tendency towards lower in- 
telligence test scores. ‘Post-operative changes on tests of speed- 
accuracy, persistence, level of aspiration, and on the Rorschach 
"experience type" gave no consistent support to the hypothesis 
that a psychctic group becomes more extraverted after leucotomy.’ 
"There were no significant changes after leucotomy in the scores of 
tests designed to measure Eysenck's . . . factor of personality organ- 
ization. The general direction of the changes which occurred 
on tests of suggestibility, static ataxia and on the oscillation test 
suggested an increase in the “normality” of the group.’ 

When one is dealing with definite hypotheses, negative results 
may be as important as positive results. In terms of our conception 
of the psychotic axis as running orthogonal to the neuroticism and 
extraversion-introversion axes, it isquite easy to understand why 
results found on a group of neurotic patients cannot be duplicated 
on a group of psychotic patients; indeed, any other result would 
have thrown doubt on the general theory. Similarly, we may add 
to our positive forecast regarding the shift of psychotic patients 
on the psychoticism axis a negative one, viz. that no such shift 
‚ would take place in neurotic patients. Both predictions are easy to 
test, and refutation or verification should only be a matter of time. 


(2) The Employability of Mental Defectives ^ 


The work here described was undertaken to study the employ- 
ability of mental defectives; this severely practical aim has theo- 
retical implications of great importance, however, which were 
clearly brought out by the two investigators, J. Tizard and N. 
O’Connor, in a series of papers describing their investigation 
(19504, 1950b, 10506, 1951, Tizard, 1951, O'Connor, 1951). The 
present account is concerned only with that section of their data 
which is relevant to our general hypothesis of a ‘neuroticism’? 
factor; for more detailed descriptions the reader is referred to the 
papers quoted above. : 

It is widely believed that mental defectives, certified as suc. 
and segregated in special institutions, are characterized by 1.0.5 
well below the somewhat arbitary but widely accepted limit of 70 
originally suggested by Terman. Some writers have suggested 
lower limits than this, and most authorities would consider that 
factors other than the I.Q. should be considered in certification; 
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however, it is widely agreed that although individuals having 
of over 70 may occasionally be certified, only very excep ae 
circumstances would justify such a step. This general picture is ҮН 
in agreement with results obtained on 104 high-grade institutior 
ized male mental defectives, who were given 5 tests of ues 1 
intelligence. The tests used, together with the mean LOs орыш 
Бу the group, are given in Table XLIX; it will be M 
clearly that the average level of ability of this group is repr eme 
by an І.О. level of about 75. Thus the average score of this gr E 
is well above what one would normally have expected to be 2 
maximum score. Indeed, scores as high as LO. 120 and € 
were recorded by isolated individuals on the Koh’s Block Deng 
test, the Binet Vocabulary test, and the Porteus Mazes test! 


TABLE XLIX e 
Test ^ Mean LQ. 


'Koh's Block Design, Alexander Version 


75 
Progressive Matrices, 1938, untimed 75 
Binet Vocabulary Test 71 


Porteus Mazes Test, Vineland Revision | 83 
Cattell Non-Verbal Test, Form I.B.* | 73 


* S.D. adjusted to 16 from 25 as given by Cattell, so as to 
make the results comparable with other tests. 
Several differe 


Certification may be at fault, i 


: : A e 
Snostic testing, constitutes E 
nt. Even if we assume that th 


T 
E». 
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regression in assessing intellectual status and development is un- 
doubtedly a potent cause of aberrant I.Q. values in samples of 
high-grade mental defectives. 

However, in addition to these two possible explanations of the 
rather puzzling facts reported by Tizard, we must consider the 
possibility that certification is carried out on the basis, not merely 
of intellectual defect, but of a combination of mental defect and 
neuroticism. The child that is merely dull, with an І.О. of бо or 
65, may easily escape certification; the child that is less dull, but is 
also suffering from emotional instability, is far more likely to be 
found an unbearable nuisance by Society, and to be certified as a 
mental defective. On this hypothesis, ‘mental deficiency’ of high- 
grade defectives would be regarded as a combination of intellectual 
deficit and emotional instability, weighted differently by different 
certifying officers. High emotional stability could then counter- 
balance а low I.Q., while low stability would offset even a — 
moderately high I.O. 

On the basis of these considerations one would be able.to make 
two predictions regarding the employability of high-grade defec- 
tives. The first prediction would be that they would be rather 
variable with respect to neuroticism, the second that neuroticism 
would play a more important part than intelligence in their success 
or failure to adjust successfully to their employment. One further 
_ prediction may be made, namely that neuroticism in defectives can 
be measured objectively with the same tests used with normals. 
These three predictions, or hypotheses, form an interlocking system 
which can easily be tested empirically. 

The first hypothesis to be tested is clearly the one related to the 
applicability to defectives of the tests of neuroticism found to be 
valid in normals. The subjects used in this study were 104 high- 
grade mental defectives; they were consecutive admissions to 
Darenth Park, with cases with physical handicaps, ог with I.Q.s 
below 50 excluded. Mean age was 21 years, with an S.D. of 4:6. 
The following tests of neuroticism were used: (1) Body Sway test. of 
Suggestibility; (2) Heath Rail Walking test; (3) Manual Dexterity 
(U.S.E.S. tests M and N); (4) Finger Dexterity (U.S.E.S. tests O 
and P); (5) Speed on the Track Tracer test. Also included wasa 
measure of intelligence, namely the Matrices test. In addition to 
these tests, a four-point rating of stability was used, based on the 
social behaviour of the defectives tested. Each subject was rated by 
three judges, a p: ychiatrist and two psychologists, who were equally 
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familiar with the subjects. Intercorrelations of judges’ ratings 
averaged -6; a combined rating was obtained in which complete 
agreement of the three judges was required. The four points of the 
rating scale were defined in detail ( O’Connor, 1951); briefly they 
were: (1) Markedly stable, (2) Stable but immature, (3) Rather 
unstable, (4) Markedly unstable. Numbers of defectives in these 
four categories were respectively 45, 15, 29, and 15. 
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cism to the extent of -46. This is slightly higher than the value 
given for the correlation between intelligence and neuroticism in 


dynamometer grip at 2 
This type of test had alwa: 


» however, discrimination was very 
ncluded with the others in Table L 


f ing of the 104 defe 
or employability, was estimated Ъ 


of the scales shown in Table LI. 


ctives, their work success 
у а Social Worker on the basis 


| 
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TABLE LI 

Rating Scale N 

Conspicuous all-round success on daily licence 7 6 

Settled down well in first job in daily licence | 6 13 

Satisfactory on daily licence after not more than two 

replacements | 5 17 

Settled on daily licence after several trials | 4 зо 

Failed consistently on daily licence | 3 16 
Successful in institution work, but unlikely to be con- | 

sidered for licence | ° 4 

Unsuccessful in Institution workshop | 1 8 


This employability criterion was incorporated in Table XLIX, 
and it will be seen from Figure 33 that it had a factor saturation 
for neuroticism of -66, as well as a slight correlation with the motor 
co-ordination factor (-31). It appears, then, that our original hypo- 
thesis is borne out by the results; employability of mental defec- 
tives is forecast with considerable accuracy by neuroticisin tests. 
Employability is forecast by both factors in combination to the 
extent of a correlation of +73. Considering the unreliability of the 
criterion (no data are available on this, but a general knowledge 
of the many chance factors entering into a person’s success in 
employment under the general conditions of this experiment sug- 
gests an upper bound to reliability in the neighbourhood of ·8) this 
must be regarded as a remarkable success. The conclusion seems 
justified that neuroticism is a very important factor in the employ- 
ability of defectives. It will be seen from Table L that the influence 
of intelligence on employability is very much less important; the 
correlation between employability and score on the Matrices test 
is only :34, so that only about one-fourth as much predictive power 
is given by intelligence tests as by neuroticism tests. А 

Тһе social consequences of these findings аге considerable. 
It will be remembered that this group of 104 defectives was not 
selected but constituted successive admissions of physically not 
handicapped persons with I.Q.s over 50. Two-thir ds of this group 
was suitable for employment in the community, at wages averag- 
ing £4. 105. od., and going up to £7 оп occasions. A further 29 per 

„cent were suitable for institution employment. Only 8 per cent 


were unemployable custodial cases. Both society and the defectives 


themselves would benefit from the introduction of routine test- 


ing along the lines suggested of all high-grade defectives, and a 


в 
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vigorous policy of licensing and work-placement. It hardly шэн 
emphasizing, however, that all such work should be supervised zd 
controlled by fully qualified clinical psychologists, well acquain гн 
with the problems, both psychological and statistical, which ari : 
in mental testing, and specially trained in the administration B 

objective performance tests of personality. Untold harm could Ki 
done by the use of psychological tests in unskilled hands; e К 
itis only too obvious that the indiscriminate use of such tests as 1 3 
Binet by those whose background training has not include Ё 
considerable amount of mental testing and statistics is doing muc i 
harm to the whole mental testing movement. How much grea 
could be the harm resulting from the indiscriminate use of the шин 
more complex and difficult tests used in personality measurement: 


(3) Work Adjustment of Unskilled Factory Workers 


psychologist, and an е 
b 


ost stress will be laid on the psychometric aspects, 50 
that for the maj 
are indebted to Alastair Heron, 


24 


part of the general appr 


ец om worker representatives through the 
Works Committee, meeti 1 


including the Chief Shop 
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Meetings were also held with the men who constituted the experi- 
mental group, and an outline given of the aims and objects of the 
investigation. 

The experimental group selected consisted of grid-casters in 
the Moulding Department. No selection took place; all eighty-five 
men who constituted the entire population of grid-casters were 
approached. Of these, one refused to co-operate, one was trans- 
ferred to another factory, and three left of their own accord before 
the conclusion of the project. Thus the experimental group con- 
sisted of eighty persons. These were subjected to a variety of 
psychiatric, medical, and psychological procedures. 

(1) Height, weight, vision and hemoglobin measurements were 
made routinely, as were also lumbar pull dynamometer tests and 
the Sclineider test of cardio-vascular efficiency. Clinical examina- 
tion of special systems was performed when indicated or requested. 

(2) A medical and psychiatric interview was carried out by the 
psychiatrist, and ratings made on the basis of this interview and 
the measures described under (1) combined for: (а) Past*health, 
(b) Recent mental health, and (c) Recent physical health, using 
4-point scales; also rated were thirteen personality traits, on 3-point 
scales. These health and personality ratings were made by the 
psychiatrist quite independently, without the aid of supervisors’ 
ratings, productivity data, or absentecism records. 

(3) Most of the psychological tests given have already been 
discussed in connection with other researches. They included the 
following tests-of intellectual ability: The Dominoes test, which is 
probably the purest ‘g’ test yet devised, the U.S.E.S. Paper Form 
Board, the Mill Hill Vocabulary test, and a letter series test. Tests 
of temperament used were the following: Crown Word Connection 
List, Hand Dynamometer persistence, Cattell's ‘237° test of per- 
severation, a ‘Worries’ inventory, the U.S.E.S. Peg and Finger 
Boards, the Track Tracer, Static Ataxia, Leg Persistence, an 
‘Annoyances’ inventory, an ‘Interests’ inventory, the Rees-Eysenck 
Index of Body Build, a ‘Food Dislikes’ inventory, and a Level of 
Aspiration test. An individual Rorschach test was also adminis- 
tered, but results have not been analysed in conjunction with the 
other tests and will not be discussed here. Total time of administra- 
tion was in excess of two hours per subject. илэн 

Having described the predicting variables, we must now turn 
to the predicted variables, і.е. the criteria for this particular 


research. It is well known that in industry good criteria are hard 


e 
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to find, and much research was devoted to finding reliable and 
valid criteria of Work Adjustment and of Productivity. шт 
(1) Work Adjustment. А panel of six raters was ee is d 
consisting of the foreman, a former chargehand now in the ор i: 
mental office and in daily contact with all the men, and the Ө 
chargehands, who rotate independently of the three ресе 
shifts worked by the men. Two sets of ratings were obtained fro: d 
the raters, an interval of five weeks elapsing between tho M 
ratings. The method used is rather original, and is fully descri 
by Heron (1951). 


*Will you please listen carefully to the explanation which I Ы, 
to each of the six supervisors who are taking part in this section 
the research project. | | hom 

"The object of this exercise is to classify the eighty men w "e 
we have seen in the Mouldirg Department with respect to the 
apparent adjustment to the total demands of the job-situation. No 

"It s important to make it quite clear from the outset, firs вэ 
all, that these classifications will remain strictly confidential to t 5 
members of the Team; and secondly, that we do NOT want 3 
rating which is based solely or mainly on a man's rate of produc 
tion as compared with others. 

‘In front of you are three trays; in my hand is a set of cards, 
each card bearing the name and clock number of one of the eighty 
men who are taking part in the research. ` + 

"In а few moments I shall place a large card at the head of each 
ou an opportunity to read what they ин. 
€ two side ones describe extremely differen 
nding you one small card at a time. If yos 
se name and clock number is on the card і 
correctly described by EITHER of the two side trays, then place 


the description is too extreme 
then drop his card in the CENTRE tr. 


this first sorting, there will be very m 
than in the two s 


ay, marked ‘Average’. ын 
any more in the centre tray - 
r. Is that clear?” 


. , (PAUSE. ANSWER any QUESTIONS.) 


(PLACE LARGE CARDS AT HEAD OF TRAys. PAUSE TO LET 
HIM STUDY THE, DESCRIPTIONS. ÁNSWER ANY QUESTIONS.) 


Ei | 
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Ч shall hand you a fresh card every 5, seconds, like this.’ - 
(DEMONSTRATE WITH THREE BLANKS.) 
‘Right, let's begin, and keep up a steady pace.’ 


(When the first sorting is complete, take the cards from the 
“VERY WELL-ADJUSTED' tray and if they exceed TEN in number, ask 
the rater to look through them, and to pick out ‘the best 8 or 10’. 
Place rejects in centre tray; repeat for *BADLY-ADJUSTED' if neces- 
sary, asking him to pick out ‘the worst 8 or 10°. Now place second 
pair of large cards in side trays, take pack from centre tray in hands 
and say:) 


‘Now we are going to sort out the cards of the average men. The 
procedure will be the same as before, except that this time as the 
end trays are ло! such extreme descriptions, we can expect to have 
about a quarter of the cards in each of them, and about half in the 
middle tray. Right?’ н 


(REPEAT SORTING PROCEDURE.) 


(When the second sorting is complete, adjust the contents of 
the side trays to about 14-20 cards each if possible, but ро хот 
FORCE AN INCREASE OR DECREASE against ?he rater's selection if he 
seems satisfied.) 5 


MATERIAL ON LARGE CARDS 
Rating 
Thoroughly settled down 
Never have to worry about him 
No trouble to me at all 


Well-adjusted to the job First two 
s cards in 


end trays. 
Has never really settled down 

Always having to keep him in mind ^ 

One of my headaches 4: 
Badly-adjusted to the job e 

Has setiled down бейет than average 
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Seldom need to worry about him 
Not much trouble to me 


Fairly-well adjusted to the job 


Second two 
- cards in 
Has settled down less well than average ши 
Quite often have to keep him in mind 
4 Is quite some trouble to me 
Not very well adjusted to the job 
8 AVERAGE (Used in 
. middle tray 
on both 
sortings.) 


Product-moment correlations were calculated between each 
pair of raters on each occasion, as well as between the ratings 0 
each rater from one occasion to the other. The average inter- 
correlation between the six raters was 483, indicating a validity 
of the average rating of approximately -go (Eysenck, 1939). The 
average reliability of the raters was “781. A factor analysis of the 
table of intercorrelations was carried out, yielding a single genera 
factor which accounted for 50:5 per cent of the variance, 20! 
leaving no residuals greater than the S.E. of a zero correlation- 
'The total variance may therefore be apportioned as follows: Com- 
munality = 50'5 per cent, Specificity = 27:6 per cent. Unrcli- 
ability = 21:9 per cent. In view of the promihence of one general 
factor in these ratings, the twelve ratings for each individual rated 
were combined into a single average rating, which constitutes the 
Work Adjustment Rating defined as the measure of concern cause 
to the supervisors by the worker. 

(2) Productivity. A performance index was used, in which the 
number of srids produced, multiplied by the agreed rate of pro- 
duction expressed as a number of ‘standard minutes’ per hundreds 
is divided by the actual clock minutes worked, thus showing the 
value of a worker’s-time in terms of his actual production. Various 


-————————" 
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calculations showed that this index, which had been in use for a 
long time in connection with a wage incentive system, could be 
regarded as a reasonable one, within the limits set to high indi- 
vidual production rates by the general feeling of solidarity in the 
workshop. As is well known, fear of ‘overproduction’ resulting in 
unemployment, opposition to high taxation, which affects high 
earnings due to high production, and tensions created by out- 
producing less able members of the group all tend to impose a level 
of uniformity on production which keeps it below the maximum 
possible without undue exertion. All variations in productivity 
are therefore considerably attentuated, and much smaller than 
they would be if each individual were working ‘full out’. The 
difficulties introduced into the analysis by these extraneous factors 
are likely to reduce any correlations which might be found between 
productivity and other factors almost to vanishing point. 

Product-moment intercorrelaticns were calculated between all 
the variables. In view of the fact that age correlated significantly 
with several of the variables, partial correlations were calculated 
and the resulting table of correlations (with age partialled out) 
submitted to a factorial analysis. The results ofthe centroid analysis 
are given in Table LII, together with the rotated solution.! To 
facilitate interpretation, а diagram has been prepared of the 
position of the various tests on the plane formed by factors I and II 
(Figure 35). It will be seen that factor I is characterized by the 
four intelligence tests (Dominoes, Letter Series, Vocabulary, and 
Paper Form Board), and by two speeds of writing tests, which in a 
group of this composition are known to correlate quite highly with 
intelligence. No other test has a high positive correlation with this 
factor, which is clearly identifiable with intelligence. — 

The second factor is characterized by all those tests which had 
been found previously to be good measures of neuroticism: many 
worries, many annoyances, high static ataxia, poor ресе (оп 
both tests), many food aversions, neurotic score on Word e 
nection List, poor mental health rating. Poor job adjustment also 
has a projection on this factor, which encourages us to interpret it 
with a fair measure of confidence as neuroticism. It should be 

1 The rotated solution shown here differs slightly from that given by Heron 
himself. This difference, however, is not material to the argument. > 

2 The Word Connection List, a measure of neuroticism, раз a high negative 


saturation in intelligence in this analysis. Previous work has not disclosed any 


reason for this surprisingly high correlation. ә : 
8.8.Р. 
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pointed out, however, that there are a few anomalous findings 
which contradict previous work, and which can at the moment 
only be ascribed to the fact that this group is very different in 
composition and background to any of those hitherto studied. Thus 


INTELLIGENCE 
DOMINOES 


LETTER SERIES 


* WRITING 72577 


VOCABULARY е © PAPER FORM BOARD 
.5 2 


eFINGER DEXTERITY 


, MENTAL HEALTH RATING 
"NEUROTICISM? © WORRIES 


ғооо POORTOB oSTATIC ATAXIA 
AVERSIO! ADJUSTMENT 
ASTONS MENS ANNOYANCES 
ONON-PERSISTENCE (GRIP) 

° © INTERESTS 
NON-PERSISTENCE 
(LEG) 

е 


^ 


e WORD CONNECTION LIST 


5.—Note: some items near origin omitted 


Figure 3. 


the item ‘Many interests’ has a positive correlation with neuroti- 
cism; eurymorphic body-build, and good finger dexterity, also 


show positive correlations with neuroticism, although at such а 
ght be responsible for thése find- 


low level that sampling errors mi, resp le 
ings. However, it would not be wise to dismiss findings зон n 
hypothesis in any cavalier manner, and it may even be one o the 


е 
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most fruitful ways of developing our knowledge in this field to 
follow up discrepant results and attempt to find an explanation 
for them. 

The third factor is much more difficult to identify, although 
the fact that one pole is characterized by low persistence, many 
food aversions, high score on the Word Connection List, and speed 
of writing suggests that possibly we may here be dealing with the 
extraversion-introversion factor, in somewhat attenuated form. 
The tests mentioned characterize the prototype of the extravert, 
the hysteric, as opposed to the prototype of the introvert, the 
dysthymic, and although not much reliance can be placed on this 
identification it is well in line with previous work, although here 
again by no means all the individual tests can be said to confirm 
the suggestion made. 

Factor four is rather easier to identify; it is characterized essen- 
tially by manual and finger dexterity, speed in carrying out à 
co-ordination test, strong grip, speed in writing, and other tests 
indicative of speed and co-ordination. In a job depending so much 
on motor co-ordination, it is perhaps small wonder that ‘poor job 


adjustment is negatively correlated with this factor, and ‘high 
productivity’ positively. 
Certain interestin 


to expectation: individuals with a ‘good’ rating ОЛ 
high neuroticism scores on the 


» thus arousing supinus 
Class population not entirely 
unemployment and |e -offs, when any 


personal deficiency Was something to be hidden from the employe’ 


‘in one general factor. The data are give 
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or what might be considered his representatives. Also, records of 
past work-history and the like would of course have contaminated 
the psychiatrist's rating, which was to be correlated with items of 
this very kind, and consequently no information on these points 
was made available to him. It is not surprising that under these 
conditions, so different from those under which he is accustomed 
to exercise his skill, the psychiatrist's rating contributed little to 
the prediction of either criterion, or to the measurement of any 
factor. 

The supervisor's rating of ‘poor job adjustment’, on the other 
hand, shows high saturations on the neuroticism factor (in the 
expected direction), and also on the motor speed factor. Produc- 
tivity shows high saturations only with the motor speed factor. 
It is interesting that intelligence has no saturations of any size 
on either of the two criteria; it is perhaps less surprising that 
introversion-extraversion should be neutral with respect to work 
adjustment and efficiency. 

The data quoted above show that (1) personality tests of the 
type advocated in this book can be given effectively in an industrial 
situation; (2) they show patterns of personality organization in 
unskilled workers very similar to the patterns shown by other 
groups hiterto studied; (3) they are applicable in conditions which 
severely restrict the proper functioning of psychiatrists; (4) they 
have acertain validity in predicting work adjustment and efficiency, 
even under conditions which make the assessment of these two 
criteria rather difficult. 

In support of points (1) and (2), another study may be men- 
tioned which was carried out on yet another group of subjects 
differing in many important respects from those studied previously. 
As part of an enquiry into miners’ rheumatism, Miss Braithwait 
administered the following five tests of neuroticism to a group of 
colliery workers, consisting of 84. miners, 45 maintenance workers 
and 45 office-workers: (1) Word Connection List; (2) Maudsley 
Medical Questionnaire; (3) Leg persistence test; (4) Annoyances 
test; (5) Finger dexterity test. These tests were intercorrelated by 

lastair Heron, and a factorial analysis performed which resulted 
n in Table LIII; it will be 
seen that all the saturations are positive and in the expected direc- 
tion; the saturation of the persistence test is as high as 5510, that of 
the Maudsley Medical Questionnaire 419, while the Annoyances 
test has the lowest discriminatory power (225). Multiple R, indi- 


” 
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“cating the accuracy with which these five tests, suitably weighted, 
would measure the general factor they define, is :686; it must be 
realized, of course, that this value is subject to shrinkage duc to the 
well-known tendency of R to capitalize on chance errors of measure- 
ment. Nevertheless, the results strongly support the view that the 
tests‘used measure one underlying factor of neuroticism, even in 
a group of subjects very unusual in psychological enquiries. 


TABLE LIII 
N =1ў4 SE.zeror = :076 


1 2 3 4 5 Ay | ki? 
| | | 
n +128 | +086 | 4-040 | +-148 | :333 | i11 
1 | Residuals —7012 | —-087 | —-035 | +-064 
r +221 | +139 +013 | зад | 7176 
2 | Residuals +7004 | +045 | —-o92 


+103 +156 +519 -269 


T 
3 | Residuals —o14 | 4-036 


r —'оо9) | + '051 
4 | Residuals - ‘obs з 
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1. Word Connection List 

2. Maudsley Medical Questionnaire 
3. Leg Persistence Poor Б 

4. Мапу Annoyances 

5. Poor Finger Dexterity 


Rasusracron = :686 


(4) Student and Nurses" Selection 


In a review of the literature on student selection by means of 


psychological tests, Eysenck (19476) pointed out the promising 
start which had been made by Munroe (1945) and others in 
attempting to use personality tests of а non-c 
Cast success at University, and maintaine 
spread use of tests of this typ 
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gate this hypothesis, and although results obtained so far are less 
direct and less convincing than those related to other hypotheses 
discussed in this chapter, they are presented here for the sake of 
completeness. 

The first study to be mentioned was carried out by Petrie 
(1948) on 49 male and 8 female students at a Medical School. The 
necessity of using group tests eliminated ab inilio most of those tests 
which in the past had been found to be good measures of neuroti- 
cism, and left three measures which with some misgivings were - 
applied to the students. These three measures were: (1) Word 
Connection List, (2) Maudsley Medical Questionnaire, and (3) 
the Index of Inaccuracy. This index, originally used as a measure 
of neuroticism by Himmelweit (1950), was calculated by dividing 
the number of false answers in all the cognitive tests by the total 
number of answers attempted, and was hypothesized to be cor- 
related with neuroticism in terms of the theory that emotional 
maladjustment interferes with smooth mental functioning. Cor- 
relations between the Index of Accuracy and the two other tests 
were significantly positive for the students investigated (r = -43 
and +40 respectively), and we may therefore regard the hypothesis 
that the Index of Inaccuracy is a measure of neuroticism as sup- 
ported. In addition to these three neuroticism tests, several cogni- 
tive tests were given, including tests of V. ocabulary, classification, 
rote memory, sentence completion, fluency, decoding, and form 
perception: (The fluency test might of course also be regarded as a 
personality test, in view of the fact that low fluency has been shown 
to correlate with neuroticism (Eysenck, 1947) and other traits.) 

To obtain a criterion superior to the usual one of examination 
results, two independent judges were chosen by the Dean, who 
rated the students on a five-point scale with respect to their medical 
ability. The combined rating was found to have a reliability of -8o. 
Correlations of the neuroticism measures with the rating were all 
in the expected direction, but disappointingly low. For the two 
verbal tests (Maudsley Medical Questionnaire and Word Con- 
nection List) correlations were — ‘16 and — :08 respectively; for 
the Index of Inaccuracy the correlation was — :27, and for the 
Fluency test it was :29. Except for the latter two tests, the correla- 
tions are too low to be of any practical usc. A multiple R of -63 was 
found from the battery as a whole, excluding sentence completion 
and Word Connection List; this R is of course a slight overestimate 
in view of the well-known fact that multiple R tends to capitalize 


. 
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on chance errors. However, later work done with these tests has 
shown that the correlations observed in this sample are typical of 
other groups as well, and that a predictive accuracy of between 
:5 and -6 can easily be reached. Given the present selection ratio 
obtaining in medical schools, it can be shown that by the use of 
tests the failure rate could easily be reduced from its present level 
of 18 per cent to between 1 and 2 per cent. Similarly, the number 
of students rated ‘very good’ could be increased from the present 
level of 10 per cent to between 30 and 40 per cent. While non- 
cognitive tests did not contribute to any considerable extent to this 
prediction, the discovery that the Index of Inaccuracy is a measure 
of neuroticism must be counted as a valuable addition to our 
knowledge of the operational meaning of this term. 
^ Another attempt to use non-cognitive tests was made by 
Himmelweit (1950, 1951), though on a rather larger scale. This 
study is of particular interest because it contains a comparison їп 
predictive accuracy between several different types of information, 
such as iaterviews, cognitive tests, temperament tests, biographical 
‘information, and achievement tests. In scope and thoroughness, 
this study is without doubt the outstanding British contribution to 
the experimental investigation of student selection. + 
Students tested were first of all submitted to the usual admis- 
Sion procedure, which consisted of an interview, a general essay 


to be subsequently correlated with the candidate's paper qualifica- 
tons and Entrance Examination results. Nevertheless, it is desir- 


being given to the following factors: 
(а) General intelligence, 
(0) Previous education, training and experience 
(c) Interests and motivation. 5 
(d) Personality and character. 3 
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(3) "Interviewing Boards are asked to make a single general 
assessment of the candidate's claims, not to sum up the results of 
Separate assessments under each of the above headings. 

The Boards were asked to make the overall assessments on a 
nine-point scale. 

Students were tested with a battery of cognitive tests, similar to 
that used by Petrie, but more extensive. They were also given a 
Set of general knowledge tests, and tests measuring their aptitude 
1n reading Tables, Charts, and Graphs. A detailed biographical 
questionnaire was also completed. Last, the following tests of non- 
Cognitive personality qualities were given: (1) Shipley Inventory, 
Form C. (2) Ranking Rorschach test, an adaptation of the Har- 
tower-Erickson Multiple Choice Group Rorschach described by 
Eysenck (1947). (3) Level of Aspiration test, in connection with a 
cancellation test. (4) Level of Aspiration test in connection with a 
decoding test, (Tests 3 and 4 are attempts to make use of the level 
of aspiration technique in a group situation.) (5) Speed test— 
decoding. (6) Speed test—cancellation. (7) Index of Inaccuracy. - 
(8) Introversion test. (This consisted of the difference in standard 
Scores between the vocabulary and the paper formboard, test, а 
Score Which had been shown previously to be diagnostic of intro- 
Version (Himmelweit, 1945).) 

The criterion against which tests, interviews, and other data 
еге validated consisted of the Intermediate and the Finals exam- 
inations, An attempt was made to assess the reliability of these 
examinations, using Guilford's equation 205 (1936) in order to 
obtain a minimum estimate of the reliability of the constituent 
part Measures, and using the Kuder-Richardson Е ormula (1937) 
to obtain а minimum reliability of their sum. Average estimates 9s 
examination reliability are -74 for the Final examination, and ‘8o 
for the Intermediate. While this is a minimum estimate, an applica- 
Boh of Eysenck’s formula for estimating the true reliability of the 
ratings of judges (1939) gave figures very similar to the above, 
Which may therefore be accepted as being reasonable estimates of 
E € true reliabilities. The papers in intermediate and final examina- 
tons Were correlated for a set of 245 students, and the resulting 
matrix factor-analysed. A general factor, accounting for 37 per 
үш of the total variance, was first extracted, followed by a bipolar 
actor, accounting for 6 par cent of the variance, which divided the 
Papers into two sets—those given in the intermediate, and those 
Siven in the final examination. It is clear that what is common to. 
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the two examinations is more than six times as important as what 
is specific to each. A 

This finding suggests the possibility of correlating the two 
examinations themselves in order to find a maximum possible pre- 
diction—it is very unlikely that a test can predict an examination 
better than another examination very similar to the one serving as 
criterion. This correlation turned out to be very slightly in excess 


of 6o. The lowness of this figure reflects in part the unreliability of | 


the two examinations; when the observed correlation is corrected 
for attenuation, the corrected value approaches -80, which ва 
better estimate of the true correlation between the two examina- 
tions, assuming both to be perfectly reliable. 

Test prediction of examination results is inevitably lower than 
the observed inter-examination correlation, and amounts to 46 
for the intermediate examination and :55 for the final examination 
(n = 232 for the former, and 114 for the latter).! Correction for 
lack of reliability in the criterion would raise both correlations 
approximately to the -6 level, but of course this figure is of purely 
theoretical interest. Of more interest is the finding that, using the 
obtained correlations only as indicators of predictive success, it can 
be shown that the use of tests as selection criteria would reduce the 

. number of failures from over 20 per cent to something like 5 Рё 
cent, under the selection conditions actually obtaining at the same 
time. s 

Having noted the general results of the testing procedure, We 
may now return to the non-cognitive tests. Some of these, like the 
Inventory and the Group Rorschach, failed to give significant 
correlations. The two level of aspiration tests gave results in the 
expected direction, i.e. showing greater failure rates among thos¢ 
having high aspiration Scores; correlations for the Aspirations test 
(cancellation) „апа Aspiration test (decoding) were — ‘20 an 
— 11 (Final and Intermediate examination respectively), 4? 
— '15 and — -10. The Index of Inaccuracy again gave the best 
resulf, showing correlations of ‘51 and -26 with the two examina- 
tions. While because of its mode of derivation this Index correlates 
with the tests of ability from which it is derived, these correlations 
are too low to account for more than a small proportion of its 

1 Applicatic.... 
to another batch 
usual decline in 
to new groups. 


of the regression weights derived from this sample of studena 
gave a correlation of -42 with final success, thus showing ү D 
predictive accuracy associated with multiple R when арр He 
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predictive variance. The speed score on the decoding test gave 
significant prediction, while the speed score on the cancellation 
test failed to do so; all these correlations are too low to be of any 
interest. 
. Аз would have been expected from the general hypothesis link- 
ing Inaccuracy and high Level of Aspiration scores with neuroti- 
Cism, these tests intercorrelate positively. The two Level of 
Aspiration tests intercorrelate -47; Inaccuracy correlates -20 and 
‘09 with the other two tests. While these correlations are very 
small, they are derived from tests which are themselves very 
unreliable, and if individual tests had been given higher correla- 
tions would almost certainly have been found. 
- We may now turn to a comparison of the test results with the 
Interview results. In the first place, correlations of interview ratings 
With all tests of intelligence are insignificant; indeed, the majority 
are negative. If the tests are accurate measures of intelligence, the 
Interview would appear to tend to give preference to the duller, 
rather than to the brighter students. In the second place, the 
interview shows negligible correlations with the other Entrance 
Examinations (r = — -o2 and 10). Thirdly, the interview pre- 
dicts examination success to the completely insignificant extent of 
07. It can be seen that although the non-cognitive tests used in 
шш Investigation did not predict success with anything like the 
accuracy one might have hoped for, ‘nevertheless they were 
markedly superior -to interviewing procedures of the kind des- 
ae This failure of the interview is only one of many instances 
showing the impossibility of achieving reliable and valid predic- 
рош 9n the basis of subjective ratings, personal impressions, and 
clinical insight; many other examples have been discussed in an 
Carlier chapter. However rudimentary our objective methods of - 
Personality investigation, and however difficult their application 
T the Broup situation—even now they can be said to give results 
et in reliability and validity to the interview. 1; 
in t should be borne in mind, in assessing the contribution of the 
terview, that the interviewing board had at its disposal the paper 
qualifications ang background of the candidate. American experi- 
се has shown that these data alone would normally give pre- 
CN considerably higher than those made by the Board. It 
“тусад M ие 
If t gative in sign, as we l П 
€mperamental factors play а part in.the achievement of 
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University students, it would appear even more likely that suey 
factors would influence the work of nurses and others who are in 
constant contact with people of one kind or another, and к. 
academic achievement is only in part related to their professiona 
proficiency. An experiment to test this hypothesis was carried Ой 
by Petrie and Powell (1950), who tested 126 nurses at опе oft € 
well-known London hospitals. Petrie paid particular attention to 
the question of a suitable criterion, as the choice of a criterion is ОЁ 
the utmost importance in all prediction testing. 8: 
A special rating scale was devised, based on available English 
and American scales intended for nursing and allied professions. It 
asked for a 5-point rating on each of 18 personality and ability 
traits. The rating was carried out after the nurses had been in 
training in the hospital for at least eighteen months. Each nurse 
was rated by three independent judges who knew her well. The 
average intercorrelation between these three judges (matron, ward 


sister, and sister tutor) was 649. A total rating was derived for each 
nurse, consisting of the 


on the eighteen traits., 


(1) Knowledge of 
nursing skills, 

(2) Ability to ada 
Imagination; foresi 
Situations. 

(3) Ability to rise to th 


underlying principles of nursing practice and 


pt these to the individual needs of the patient. 
ght; ability to anticipate requirements of new 


€ Occasion in emergency. Initiative; 
to difficult situations. 


anize and time duties successfully. 
Management of own work and others, 


their relationship to social attitudes and ability to deal with other 
human beings. These traits are: 2 й 


(1) Co-operation with other members of the ward team. Influ- 


sum of the ratings given by the three judges | 


-— 
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ence on associates and juniors ; relationship with other members of 
the Staff. : 

(2) Satisfactory relationship with the patients. Ability to 
gain their co-operation. Patience; understanding; kindness and 
sympathy. 3 

(3) Satisfactory attitude to patients’ families and visitors to the 
ward. Thoughtfulness; kindness and tact; courtesy. 

(4) Loyalty to the standards of the Training School and the 
Hospital. Co-operation with authority; ability to accept criticism. 

It appears, therefore, that our hypothesis regarding two quite 
different types of demand being made of a good nurse is justified. 
One demand is that she should have enough mental ability to cope 
With her work; the other that she should have the kind of tempera- 
ment that allows her to make the maximum use of her ability in 
this profession. The two sets of ratings listed above as having the 
highest positive and negative saturations with the bi-polar factor 
Were summed for cach nurse, and the totals, representing respec- 
tively intellectual and social relationship traits, were correlated 
With the tests administered to the nurses. ? 

Correlations of the tests used with the total rating criterion are 


‘given in Table LIV. It will be seen that for the cognitive as well as 


і ТАВІЕ ШУ 
List of Tests and their Correlation with the Criterion: r 
5: Accurate clerical observation. Minnesota Clerisal Test. Number : 
of mistakes in comparison of figures. '325 
2. Accurate clerical observation. Minnesota Clerical Test. Number ; 
of mistakes in comparison of names. E M 


3. Manual dexterity. Average score on O'Connor Tweezer Test. М 
4. Persistence at a task. Productivity in word building test. -263 
5. Persistence at a task. Time spent on word building test. :156 & 
6. Kent-Shakow Performance Intelligence Test. Intelligence scorc. 173 

7. Kent-Shakow Performance Intelligence Test. Number of mis- 

taken moves, 

8 on-verbal Intelligence. Penrose Pattern Perception Test. 2 
9. Maudsley Word Association Test. ‘Neurotic’ score. E К 
10. Adaptation of Interest Test Pressey X-O. 27 


—:240 * 
"209 * 


11. Adaptation of Annoyance Test Pressey X-O. 52533 
12. Verbal Intelligence. Mill Hill Vocabulary Test. — | 
13. Speed Accuracy Preference. Number of mistakes in trial on ө е. 


track tracer whe is stressed. 
14. Concentration па шау 8 an 
15. Istractibility Test. Number of mistaken answers. =ч 
Strength of maximum grip on the Dynamometer. 3 142 
* Starred correlations are significant at the 5 per cent level. Most of Em 
Others are in the predicted direction and would be significant if the one tai 
test were applied, .. 
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for the non-cognitive tests all the correlations are in the expected — 
direction, and that most of them are significant. A multiple R of 
above -6 was obtained from a combination of 12 of these tests. 
Correlations of selected tests with the ‘intellectual ability’ and the . 
‘personal relationship’ criteria are given in Table LV. It can. be 
_зеёп that some of the tests are more closely related to the *ability 
criterion, others to the ‘personal relationships’ criterion. For 
example, both the non-verbal and the verbal tests of intelligence 
are closely related to the ratings on ‘ability’, but not at all to those - 
on ‘personal relationships’. The score on the Word Connection. 
List, which is a neuroticism test, is more highly correlated with the 
‘human relationships’ criterion than with the ‘ability’ one. On the 


TABLE LV 


CORRELATIONS OF INTELLECTUAL ‘ABILITY’ AND ‘PERSONAL 
RELATIONSHIP’ TRAITS WITH SOME OF THE TESTS USED 


Р Intellectual | Personal. 
, ility lationship 
т. ZEE Lu 
= cop — — мо. Ч 
O'Connor Tweezer. Average number on two trials --:286 bes 
Word Building. Number of words produced +302 +13 
Kent-Shakow Form Boards—number of mistaken P 
moves —:906 (185 
Репгоѕе Pattern Perception Test. Number correct +270 FPE 
Maudsley Word Connection List. Neurotic score — 160 RI 
Educational Status La | +369 +131 
Mill Hill Vocabulary Test | +7269 d 
Nursing Examination—theoretical 4-482 tus 
Nursing Examination—practical +358 +:863 28 


manual dexterity test, which we have shown elsewhere to be а E 
of neuroticism as well as a test of ability, scores appear to be relate ; 
to both types of criteria. It is also in line with our hypothesis that 
when the examination marks of the nurses are correlated with the 
two criteria, the theoretical examination is found to correlate 482 
with the ‘ability’ criterion, but only -152 with the ‘personal rela- 
_ tionships’ criterion, while the practical examination correlates 
equally well with both criteria (т = -358 and -363 respectively): 
The outstanding results of this experiment are (1) the relatively 
small role played in nursing efficiency by intelligence (particularly 
as measured by verbal tests—correlation of Vocabulary with the 
total rating was not significantly different from zero), and (2) the 
relatively important role played by temperamental factors, 48 


e 
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measured by various tests of neuroticism. None of the zero-order 
correlations are very high, but in combination they predict effici-- 
ency in nursing better than a battery of purely cognitive tests. 


(5) Sense of Humour and Popularity in Teachers ` 


So far in this chapter, we have discussed the position in factorial. 
Space of concepts such as employability, job satisfaction; job effici- 
ency and others having occupational reference. The same technique 
of, dimensional analysis, of course, can be applied to concepts 
having no such reference, and as an example an experimental 
Study of the position of ‘sense of humour’ with respect to intel- 
ligence and neuroticism will be quoted. This investigation, carried 
Qut by Е. Loos (1951), made use of several types of test to measure 
five Separate meanings of the term, ‘sense.of humour’. 1 
а (1) Sense of humour may be looked at from the point of view of 
a сабац a person is considered tb possess this trait if his appre- 
on of the relative funniness of jokes, films, events, etc, agrees 
ma 5. own, or with that of the majority. (2) Again, a person 

Y be considered to possess ‘sense of humour’ if he appreciates a 
arge number of jokes, witticisms, etc., or laughs a great deal about 
кашу things, irrespective of the order of funniness into which he 
Ns Put these jokes, events, etc. (3) Alternatively, sense of 

E Our may be looked at from the point of view of creation; a 
ә ice Considered to possess this trait if he is constantly making 

other таулар attention to amusing features of the situation, or m 
фу n creating merriment. (4) A. fourth method of looking E 
the umour would emphasize the 'social stimulus quality o 

Person, and rely on ratings by others of his ‘sense of humour $ 
(5) Lastly, sense of humour may be defined in terms of self-ratings. 
These five definitions may give rise to tests which are highly сог- 
related, but it is quite possible that they may point to quite un- 
related aspects of personality. 

The following tests were included in the research to measure 
the five aspects of ‘sense of humour’. (1) A Limerick Ranking test, а 
in which twelve limericks have to be ranked in order of funniness; 
the score is the amount of agreement with the average ranking of 
the whole group. (2) Limerick Liking test, in which the subject has 
to indicate how many of the limericks he considers funny; this 
number is his score. Both these tests were taken from an earlier 
Paper Бу Eysenck (1643). (3) With respect to the ‘creative humour’ 
aspect, two tests were included. In the first of these, captions had 


id E 
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to be written for cartoons from which the original captions had 
been eliminated; in the second, social situations were outlined to 
the subjects who had to find an amusing ending for each situation. 
Scoring of these creative efforts was carried out by judges who 
did not take part in the experiment. The validity of the average 
of twenty judges’ ratings, using a formula developed elsewhere 
(Eysenck, 1939), was found to be in the vicinity of -90, 50 that 


+ . d р r 
scores on these two tests are reasonably objective. (4) Social humour 


rating. This consisted of an averaged rating of the subjects’ Ёо 
of humour, made by her colleagues. (5) In addition to these Ес 
and ratings, each subject was asked to give a self-rating of her ow 
sense of humour. In addition, a ‘popularity rating’ was obtaine' 
for each girl. ! 
To assess the position of each girl оп the intellectual dimen gn 
Thurstone’s Primary Mental Abilities tests were given to the gir'5; 


1 : Ч aber, / 
we thus obtain scores on thé space, verbal, reasoning, Dum? ^"^ 


and word fluency factors. To assess neuroticism and other poa 
temperamental factors, a group Rorschach was given," as M likes 
the Word Connection List, and the Worries, Likes, and Disi a 
tests mentioned earlier in the book. Also used was the Rosenzwe® 
Picture Frustration test, scored for Impunitiveness and for Extr 
punitiveness minus intropunitiveness. f with 
The subjects were 76 girls in a teachers’ training college, а 
an average age of 19. Scores on the nineteen tests used were 11 4 
correlated for the whole population, and a factorial analysis pE 
formed. Four significant factors were extracted. Factors Ian 
are plotted in Figure 36. The five Thurstone tests define fees 
as an intelligence factor; the four neuroticism tests (Word A. 
nection List, worries, word likes and dislikes) define the d 
factor as one of neuroticism. The results threw some interest ti- 
light on the relation of sense of humour to intelligence and асои 
cism. Let us take the Limerick Ranking test first. Agreement MET 
the average here is indicative of lack of neuroticism, as might ha i 
been expected; the slight positive saturation of this test with inte g 
ligence is also not contrary to expectation. The Limerick Dikine 
test is also correlated with neuroticism, in the sense that the mor 
stable girls tend to like a larger number of limericks. This test has no 
appreciable saturation with intelligence. We may conclude tha 
1 The Rorscl ach was scored for neuroticism ‘by means of the sign’ method: 


Individual correlations were calculated between sensc"of humour tests and the 
various Rorschach categories; these correlations were all insignificant. 


25. Needle-threading test. 


threading test. 


26. Close-up view of needle- 


27. Agility (water-carrying) test. 


ь X 


сг 
ü 


28: Kretschmer abstraction test. 
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| the more stable subjects like more limericks, and agree in their 
preferences more with the average judgment, than do the less 
stable girls. 


NEUROTICISM 


WORD CONNECTION 
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\ 5 
5 © ovsuines 
ике (Reversen) -4 © worries 
3 
LIMERICKS 
UKED (REVERSED) 


Ш 


CAPTION WRITING 
CARTOONS) 


POPULARITY 
© 


SPACE 
REASONING 
©, 


"3 INTELLIGENCE 


E m" =] Orr 2 o> РЕЗ: 


NUMBER [o] 


кокьс Ён 
NEUROTICISM Д.р m VERBAL 
. COMPLETION 
(HUMOUR SITUATIONS) 
| © 

2 LIMERICK 
RANKING 

- 3 SOCIAL 

© HUMOUR 
RATING 
9 

-4 SUBJECTIVE SENSE 


ок HUMOUR RATING 


Figure 36 


umour tests, it will be seen that 


As regards the two creative h 1 : } 
th intelligence, as might have 


they both have slight correlations wi 2 
been expected. Their saturations with the neuroticism factor are 


| . smalland oppositeinsign to'each other, so ару само ышану 
relation between neuroticism and creative humoür as measured 
here. This lack of relation may be a mere artefact, due to the 


imperfection of the tests used. X 128 
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- The social humour rating; and the subjective sense of humour - 
rating, both show marked saturations with the neuroticism factor, 
in the expected direction, and fail to show correlations with the 
intelligence factor. It is interesting in this connection to see that 19 
the popularity rating is unrelated to either neuroticism or intel- 
ligence; in other words, an unstable girl, or an unintelligent one, is 
just as likely to be popular as a stable, or a bright one. But an 
unstable girl is not likely to be thought of as having a good ‘sense 
of humour’, nor is she likely to consider herself as possessing this | 
. trait. This agreement between sel judgment and poor rating is 


- 


"> surprising in view of the frequent claims made in the literature — 


- . that few people admit to having a poor sense of humour; perhaps. 
i5 "the imperative demand for this quality in the American culture is 
` more pressing than it is in England! 2 


~ 


-  -.. A.third factor was clearly identified by the popularity rating, — 


he only exception to this rule, inter- 

S. + H . A 

estingly. enough, is the ‘social stimulus value? concept of sense of 
ating’, as we have called it, forms а 


: arity and with self-rating on ‘sense of 
humour’, Apparently then it is the stable, popular girl who is con- 


sidered to have a good sense of humour, and who considers hersclf 
to possess this quality; it is the unstable, unpopular girl who is 
lacking in sense of humour, and who knows herself to be defective 


е reader to judge to what extent a research 


-~ such as the present one may be said to have clarified the concept it 
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was designed to analyse. Clearly the results cannot be regarded as - 4 
in any sense final; different sets of tests, different populations, 1 
different culture patterns might easily produce results differing 
widely from those reported. Nevertheless, it is only through com- 
parisons of different experiments; carried out in different cons, 
ditions, that we can learn about the influence of those aspects - 

of the experimental situation over which we have no control. At — 
the very least, this research would seem to have confirmed the 
. Suspicion expressed by many writers that the concept of ‘s e oP a 
_ humour! is not a unitary one, but that we are dealing rather wi el 3 
a multitude of independent aspects which must be quantifie ап А. 

studied separately. This research marks the beginning of a taxo- . ~ 

. nomic study of the position of these various aspects within the 32 
gl Personality spacc. Зон x х z 
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Chapter Eight 


THE ORGANIZATION OF PERSONALITY 


LL through this book, the reader will have noted that implicitly 
Д: explicitly а stand has been taken оп a very important 

i theoretical issue. In the field of personality research, as in 
the more strictly experimental fields of perception and memory, 
there are two fundamental ways in which we can order our think- 
ing, and along which we can plan our experiments and construct 
our theories. The first of these is that of atomism or elementalism; 
the other is that of wholeness of gestalt—the organismic way. The 
great majority of psychologists who deal with the concept of 
personality either experimentally or clinically have adopted the 
organismic point of view; the present book, like its predecessor, 
takes an outspokenly atomistic, elementalistic point of view. Here 
and there, throughout this volume, we have taken up this argu- 
ment with respect to individual issues; now we must come to grips 
with it on a somewhat more fundamental level, 

First of all, let us see precisely what is meant by the ‘organismic’ 
approach in personality study. We cannot do better than to quote 
from a book which has put the wholistic point of view better and 
more clearly than any other, namely, the Assessment of Men (O.S.8. 
Staff, 1948). Apparently, it was ‘one of the noteworthy features of 
the O.S.S. assessment system that it recognized explicitly the neces- 
: sity of relating all observations to each other, not in a mechanical 
= way, but by an interpretive process aiming at the discovery of 

general patterns and action systems, and made this the guiding 
principle of all its operations’. Underlying this feature of O.S.S. 
work we have a clearly stated hypothesis: *Organismic assessment 
18 based on the hypothesis that a trained psychologist or psy- 
chiatrist, with a fund of additional facts at his disposal, is, today; 
capable of improving to a significant degree the accuracy 0 
mechanical prediction derived from test scores alone.’ Apparently; 
‘the elementalistic approach calls for accurate quantitative 
measurements oi partial, isolated processes, wheteas the organismi 
approach comes down to inaccurate estimations of total integrate 
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processes’. It follows from this that ‘for a short over-all assessment 
the interview is probably the best and only indispensable method 
we have’. In contrast, ‘the elementalistic method is abstract and 
unrealistic, since no attempt is made to reproduce the conditions 
under which the men will eventually perform. . .. In adopting 
this method . . . the psychologist makes a radical subjective judg- 
ment at the very start by electing constituent processes, testing for 
these separately, and then adding the scores to arrive at a final 
raüng. He does this even though he knows that in actual life the 
mind does not add sequences of elementary processes to produce 
results, but organizes them into effective forms.’ > 


Clearly, the organismic point of view is а tenable one; there are " 


no internal contradictions which would make the system of beliefs 
briefly presented here illogical and mutually contradictory. Nor 
can the elementaristic point of view be dismissed on logical grounds. 
It follows that we must turn to the experimental evidence to decide 
between these two contradictory systems. Such evidence may be 
sought at three levels: (1) The level of total assessment, as in the 
O.S.S. programme itself; (2) an intermediate level, as in the 
case of Rorschach test interpretation along statistical or intuitive 
lines; (3) a fundamental level, as in perceptual organization, for 
instance. 

Г At the level of total assessment, we have already quoted very 
briefly the Michigan study (Kelly, 1949, 1947; Kelly and Fiske, 
1950). ‘The large number of subjects studied, the very large number 
as techniques of measurement and assessment used, and the excel- 
lence of the experimental design would have made the^findings 
from this programme outstanding, even if they had been less 
revolutionary in their import. From every point of view, the final 
results are a devastating comment on and criticism of the clinical 
methods of interviewing and projective testing current in most 
applied personality work. It was found that the most efficient 
Clinical predictions in terms of both validity and economy of data 
are those based only on the matters co 
file and in the objective test profiles. 5 4 
graphical and projective test data appears to have contribute 

ttle or nothing to the validities of the assessment rating. Neither 
the initial nor the intensive interview made апу apparent con- 
tribution, In fact, the predictions based on the credentials and 
Objective tests are- better than those made at thé end of ыг pro- 
Sramme on the basis of test procedures and observations: This 


The addition of autobio- 


у 


ntained in the credentials · 


, 


^ 
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"consistent trend would seem to be all the more significant in view 
of the fact that assessment staff members tended to be uniformly 
of the opinion that the interview contributed most to their unto 
standing of the case’, followed by either the projective test or an 
autobiography' (Eysenck, 1951). . 
Kelly and Fiske (1950) point out that ‘predictions based on 
individual projective tests as well as those based on an integration 
` of data from all four projective tests yielded relatively low correla- 
tions with the rated criteria. Scores from a single objective test, 
obtainable by mail at little cost predicted each of several criteria 
as well as all the clinical judgments made in the entire assessment 
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.. programme.’ They go on to say that ‘many who have seen our 


- results have been disturbed by the findings regarding the validity 
for this selection problem of specific techniques, which are felt by 
many professional psychologists to have a high degree of face 

. validity (or is it faith validity?}. Thus, it was the firm conviction 


-..: of the staff of the O.S.S. Assessment Programme that the global 


evaluation of a person permits much more accurate predictions of 
his future performance than can possibly be achieved by a more 


T MR approach. . . . Our own findings to date serve to raise 
y 
1 = 


: _ aristic approach h: 


bts concerning the v 


Although the unstructure 
© tools in personnel selecti 


alidity of this- general proposition. . · * 
d interview is one of the most widely use 
on, the writers know of no evidence in the 
literature to suggest that such interviews have other than extremely 
low validity, which hardly justifies the degree of confidence and 
У are held by users of the interview. Опе 
is most disconcerting to us: the inverse 
€ confidence of staff members at the time is 
nd the measured validity of that prediction. 


: he only study hitherto carried out in such 2 
Way as to permit a comparison of the 


aspect Of our findings 
relationship between th 
making a prediction a 


as completely failed to support the point of view 
c psychologists, although the psychiatrists ап! 

clinical psychologists who were actually carrying out the procedures 
and made the predictions were very largely supporters of this point 
of view. The elementaristic method of adding scores along well- 
known lines of multiple prediction is shown to be conspicuously 
superior to the organismic method of organizing data into a mean- 
ingful dynamic pattern for cach subject. This finding docs not, 0 
course, constitu:e а universally valid disproof. of the organism 
position, but clearly the onus of proof is now with the organ- 


of the organismi 


organismic and the element- 


Е 


THE ORGANIZATION OF PERSONALITY 279 


ismicists, and it may be added that such proof must be experi-- 
mental in kind—semantic arguments from supposedly higher 
general principles are not sufficient to controvert empirical findings. 
E и second level, comparatively Tittle research has been 
еза Mes the small-scale experiment comparing statistical and - 
MEA н ae procedures with respect to the Rorschach test, 
oig Ө ined on page 163 of this book, may serve as an _ 
К E тае 'This neglect of a very promising approach is 
Jewels 3 few psychologists have the facilities needed fora large- | 
on quiry such as would be required at the highest level of 
plexity, but most research workers would find it very easy to — 


sefül. ^ 


Set up an experiment at this level of complexity. Much us 


information could be gained in this way regarding the efficacy of : 


> ote 


Зэв шар approaches; as far аз our data are concerned the out- 

of the experiment is again hostile to the organismic approach. 

bs at the lowest level of complexity, however, in connection 
€ processes of perception or memory, that gestalt write: 


ha Ever ume : 
ave been able to support the organismic view most strorigly, and 


it is at this level that their view has achieved its greatest success. 
planned to test the пуа 


р onsequently, experimentation specially 
pru in this field also is of particu np 
wid the organismic hypothesis fails here too; then we may say 
iar pa degree of confidence that it is unlikely to stand up to 
there experimentation in more complex fields. We turn next, 
little oe to an experiment which superficially may seem to have 
light о do with the study of personality, but which throws much 
mod on the principles underlying the construction of psyciiological 
( odels of organizalion. This experiment, reported by Granger 
(5239), deals with asthetic appreciation; it covers a field, there- a 
Ж 4 Which saw the original gestalt hypotheses grow (as in the 
mork of Ehrenfels), and which is sti 
ress into which to retire for a last stand by many organismieists.. 
Granger took his stimuli from the Munsell Colour System 
(1921), which is organized around three dimensions: Value (more 
usually called lightness, i.c. the scale of greys from white to black); 
toma (more usually called saturation); and Hue (i.e. the special 
Ешь attribute which distinguishes red from blue, or green 
9m yellow). Each dimension is intended to form a scale of per- 
Ceptually equidistant steps. Three sets of tests were made up: 
0 1) Hue tests, of which there were 24; made up insuch a way as to 
old value and chroma constant within each test. As far as possible, 
y А | 
эр us Я 


о 


к 


+? 
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lar interest and import- ж 
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all ten hues were sampled in each case at all levels of the colour 
solid. (2) Value tests, of which there were 24, made up in such a 
way as to hold hue and chroma constant. (3) Chroma tests, of 
which there were 11, made up in such a way as to hold hue and 
value constant. The coloured chips which constitute the stimuli 
for each of these tests were presented to 25 male and 25 female 
subjects, screened for colour-blindness by means of three standard 
tests, under carefully controlled conditions against a medium grey 
background, with instructions to rank them in order of preference. 

In the statistical treatment, Granger made use of Kendall's 
(1948) coefficient of concordance (‘W’), which for all practical 
purposes may be considered equivalent to an average intercorrela- 
tion between the rankings of a number of subjects. All the 120 
W-values calculated (for men and women separately, and for all 
the tests) are positive, thus showing that there is general agreement 


among the subjects on the raak-order of the stimuli. W-values | 


average around -5 for hue, -2 for value, and -4 for chroma. It is 
interestiag to note that on an additional test, containing ten fully 
saturated hues, the W-values for men and women were much 
smaller than for other hue tests; this is presumably due to the fact 
that whereas the colours in the other hue sets differed only in 
respect of hue, those in the fully saturated set differed also in value 
and chroma. This decrease in agreement with increase in number 
of dimensions is of course a general proposition of dimensional 
analysis, but its empirical confirmation may be of some interest- 
' Not only do individuals agree with respect to their rankings 0 
the stimùli on each of the 6o tests, but it can also be shown that the 
hue tests correlate with each other (W = :84), the value tests 
correlate with each other (W = -44), and the chroma tests cor- 
relate with each other (W = -44). Further than that, when each 
subject is given a score in terms of his correlation with the general 
order for the hue, value, and chroma tests, it can be shown that 
these rankings agree to the extent of W = :64, so that a subject 
who has a high score on one set of tests will also tend to have а 
high score on the other sets of tests. A last important demonstration 


is that the order of preference for all three attributes of colour remains — 


invariant from level to level, throughout the colour solid. 

. These preference judgments would seem to have a firm objec- 
tive basis in certain properties of the stimulus objects. To take but 
one example, namely, hue, Granger has shown: that there exists 2 
correlation of -81 between the order of preference and the dominant 
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* Wave к 
Бэрэн: © the colours, їп the sense that the hues of shorter 
We Eave c preferred to those of longer wavelength. 
Mes pisc very briefly, some of the facts regarding the 
a е on of the simplest possible stimulis, viz. a single 
DE n these results to forecast appreciation of colour | 
According iiis ere we come against our first crucial problem. 
3 would expect A atomistic or clementaristic type of analysis, we 
1 ences ОСО] Цэ following considerations to determine prefer- 
абыя лээн ho ЖОЕ (1) Preferences for the individual 
| wo Ойл. : 2 a the combination. If we took combinations of 
resulting dina had these ranked, then we should expect the 
саа ши to show substantial agreement with a ranking 
for the single шр y on the basis of the summed preference ratings 
те аца 8: ours making up cach dyad. (2) Relations between 
Circles we rni Чаш in the combination. Thus if we take the hue 
tolus incre n that liking for a given combination of two 
increases, This 5 " the distance between them on the colour circle 
‘additive’ ral uem also could easily be translated into a numerical, 
сеоба е bs ich would specify exactly in each case which of 
care "is s should be preferred. (3) Specific and error factors. 
reliabilities eed to find perfect test-retest correlations; indeed, 
Ateord: 9 simple preference tests are seldom above 70 48. 4 
Considered i n the gestalt hypothesis, these notions would Беж 
: ject to criticism. Preferences for colour combina- 


tions ar 
are 5 af 5 Е E 285 
1 emergent qualities, essentially unlike the constituent qualities 

and impossible to 


which d 5 
| БЕСС “нарын preferences for single colours, 
om any knowledge of single colour preferencessor from 


а 


апу quantitati : 
quantitative relations obtaining between the colours.* 
experimental model 


mo ds. we would appear to have an 1 
ХЕ ла m expected to produce quite clear evidence for or 
COND HMM olite approach. Predic 

(a) or eem is in principle possible 

| objectiv nces of the single colours, 
/ ӨЁЧЬ e relations obtaining between 
€ colour solid; that is the bald stateme 

to maintain this gestalt 
who maintained that preferences for 


tion of preferences for colour 
on the basis of knowledge of 
and (9) knowledge of the 
them along the dimensions 
nt of the atomistic 


1 It в. 5 4 : 
is interesting to note that the first psychologist 


view 
caes the arch atomist Wundt (1910), 
combinations would be determined by а *Totalgefühl', ап emergent 


confi = Е 

ааа with unique properties over and above those of the component 

with th; ре (1909)‘and Titchener (1912) held a contrary view, more in line 
hat advanced here. : 
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position. Prediction based on these principles is impossible—what _ 
is involved are emergent qualities which cannot be reduced id 
known properties of the stimuli considered apart; that is the d 
statement of the organismic position. We must now turn to the 
experimental evidence. . EP". © А 
The first demonstration relates to the question of relation 
between stimuli, i.e. the hypothesis that colour combinations аге _ 
preferred in terms of the distance between the component colours — 
on the colour circle. The correlation, when a standard colour was 
presented together with a variable colour, was found to vary from 
:93 to unity in different experiments, so that we may assume that 
the hypothesis is essentially correct. (In a similar experiment by 
Clarkson et al. (1950), correlations 08-06 and -99 were observed, 
thus lending support to this generalization.)! Orders of preference 
for colour combinations correlated highly from subject to subject; 
for hues, values, and chroma. УУ coefficients average around :6 to 
`7, thus showing marked agreement. Not only do individuals agree 
with respect to their rankings of the stimuli on each of the tests, but 
it can also be shown that the hue tests correlate with each other 
(W = -26), the value tests correlate with each other (W = :46), and | 
the chroma tests correlate with each other (W = :37). And lastly; 
the rankings of the subjects on the hue, value, and chroma tests 


3 correlate to the extent of W = ‘64, i.e. to an extent identical with 
HU o ( 


£ 


found when single colour stimuli were used. : 

When scores are calcülated for each subject on the single colour 
rankings (combining scores on the hues, values, айа chromas tests); 
and cozrelated with similar scores on the combined colours rank- 
ings (again taking into account hues, values, and chromas) %/© 
_ find a correlation of -826. In other words, those subjects who agree 
most with the average order when ranking single colours also agree 
most with the average order when ranking colour combinations. 
This very high correlation indicates very strongly that the laws 
governing one type of ranking are essentially identical with those 
governing the other, thus arguing against the gestalt idea О 


. emergent properties. 


— . Wenow come to the main test of the hypothesis. Twenty colour 


combinations were ranked in order of preference, as were also the 
. л А 

2 We are discussing here only hue combinations, but similarly high corr B 
tions were found for value (-94) and chroma (00) tests, indicating that рге ES 
ences for these typ2s of combinations are also directly Gependent on object! 
relation between component colours along their respective dimensions. 
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individual colours making up the combinations. Then correlations 
were run between the observed ranking of the colour combina- 
tions (O), the hypothetical ranking derived entirely from the pre- 
ference judgments for the component colours (C), and the hypo- 
thetical ranking derived entirely from the colour intervals of the 
components (1). The correlation between О and С is 681, and 
that between O and I is :554: C and I are of course uncorrelated 
(т = соо4). When С and 1 are combined, they predict the final 
ranking to an extent indicated by a correlation of :84. When 
corrections for reliability are introduced it can be shown that the 
predictive accuracy of the combination of C and I is not far short 
of 100 per cent. Thus the final preference judgments of colour ' 
combinations are determined entirely by thé elements entering | 
into the combination, and the relations between these elements as 
determined by simple algebraic addition. It is difficult to see how 
the gestalt hypothesis could be invalidated more conclusively. ` 

‚ We can, however, take one further step. Even colour combina- 
tions may be said to be at a relatively low level of complexity. 
What would happen if we correlated score on the colour ranking 
test with score on some much more complex test of art apprecia- 
ton, such as the Maitland Graves Design Judgment test (1948); in 
which judgments have to be made between good and poor designs 
of considerable complexity? This test, which has been shown ine 
have considerable validity and good reliability, is entirely in 
and white, so that colour plays no part in it. It is easy to see that 
on the gestalt hypothesis no positive prediction could be made; 

ere again those elusive emergent properties would prevent any 
application of knowledge gained from less complex stimuli. If we | 
take our stand, however, on the demonstration ofa general factor 
of esthetic judgment (Eysenck, 1940), 4 demonstration based 
entirely on an atomistic view, then we would predict a sizeable cor- 
relation between these variables. In fact, correlations 08:59 and 
773 Were observed when the Maitland Graves test was correlated 
with results from the single colour test and the colour combination 
test. (It should be noted that these correlations are not due to _ 
piBerences in intelligence; only very small correlations Were found _ 
etween intelligence and the colour ranking tests.) Here Байт 4 

еп, the atomistic view is supported БУ the test results, ac 
Sestalt hypothesis discredited. | , 
to M in connectica with previous 

ply that this particular study has onc 
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theoretical conflict between associationism and gestalt. We have 
reported it in some detail because it is only by constantly bringing 
these complex theoretical issues into direct contact with experi- 
mental data that we can hope to arrive at a meaningful and 
scientifically worth-while conclusion. If those who follow the 
organismic line of argument would similarly subject their concepts 
and arguments to empirical testing then we might see a consider- 
able growth in the area of agrcement between different schools of 
psychology, and an abatement in the purely semantic discussion 
which so often passes for theorizing. As in so many other arcas of 
social psychology, *the same old concepts and words are tumbling 
around in the same old empty drum to the resounding echo of 
excessive verbalization' (Eysenck, 19514). ' 
How, then, it may be asked, does the atomistic psychologist 
view the problem of personality organization? We may perhaps 


attempt to answer this question by reference to an urgent psy- . 


chiatric problem, namely, that of diagnosis. Reference has already 
been made in an earlier chapter to the conflicting views, and the 
often self-contradictory pronouncements, of psychiatrists and 
clinical psychologists. In their terminology, they regard the patients 
under their care as falling into one of a number of different classes 
(hysteria, schizophrenia, neurasthenia, etc.), a view which is only 
permissible if these different disorders are regarded as qualitatively 
different; yet in much of-their writing and even more so in their 
practice they regard these different classes as blending into one 
another, as being mutually overlapping, as being in fact differ- 
entiate® along quantitative lines. Clearly the qualitative and the 
quantitative aspects must be reconciled in some way, but little 
guidance is to be found in the text-books. 

The answer to this problem is implicit in the experiments 
described in the previous chapters of this book: we must determine 
the required number of dimensions, locate them accurately, and 


measure them with a given degree of reliability and validity. 


Figure 37 may serve as a very rudimentary model of the kind of 
structure we have in mind. Using our experimentally demonstrate 

three factors of neuroticism, psychoticism, and extraversion-intro- 
version as three axes of a co-ordinate system, we can now locate 2 
given patient in terms of his exact position within this syste?! 
Leaving out of account the extravert dimension for the moment, 
we can see thatthe average person would lie in the centre of the 
diagram, at A; a strongly psychotic person, undifferentiated wit 


== 
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respect to neuroticism, would be located at P, while а strongly 
neurotic patient, undifferentiated with respect to psychoticism, 
would be located at N. A person suffering from both psychotic 
and neurotic disorders would be found at point P + N. In terms . 
of this diagram, the question: ‘Is this person psychotic or neurotic?” 
хихи аз unreasonable as the question: ‘Is this patient intelligent 
еш Two orthogonal vectors, like neuroticism and psycho- 
ism, generate a plane on which the position of an individual has 

to be indicated by reference to both vectors; we can only describe an 
PSYCHOTICISM REN 
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ТНЕ ORGANISATION ОР PERSONALITY : 
EXTRAVERSION  THREE-DIMENSIONAL REPRESENTATION 
Figure 37, 

individual by giving both his I.Q. and his height, or by giving both 
his degree of neuroticism and of psychoticism. All positions on the 
plane thus generated are possible locations for a given individual, 
and it will be seen that mixed cases are far more likely than pure 
¢ases—we are more likely to find individuals in the plane of the 
diagram than on the ordinate or on the abscissa. This preponder- 
ance of mixed cases of course agrees well with clinical experience. 
Diagnosis on this showing should consist in the accurate determin- 
ations of an individual’s position on the plane, rather than, asis now 

usual, in a simple either-or judgment (Eysenck, 19515). 
It is not hypotnesized, of course, that the three dimensions 
dealt with in this book are the only ones into which personality can 
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be analysed, and along which measurement should take place. To © 


take but one example, there is the case of intelligence, operation- 
ally defined in terms of Thurstone’s second-order factor, which is 
approximately orthogonal to all the dimensions so far discussed. 
In due course, other dimensions will no doubt be isolated and 
measured, and much prospecting has already been done by Cattell 
(1946) into possible lines of progress. But regardless of the actual 
number of independent dimensions which our picture of personality 
may require, it is clear that categorical diagnosis of the ‘either-or" 
kind are not warranted by the experimental findings; what is 
required is a separate assessment and measurement of each dimen- 
sion in turn. It is not claimed that more than a beginning has been 
made in this complex, time-consuming, and difficult proceeding; 


ете, however, that results to date аге fully in agreement 
with the general model of personality on which our procedures | 


have been predicated. 

“We have then a general outline into which to fit such facts as 
future research may discover. It is clearly a static outline, in the 
same sense in which the periodic system of the elements is static; 
however, movement can easily be introduced into this outline 
when developmental facts can be experimentally controlled. Аз 
examples we may point to the studies on identical twins and on the 
after-effects of leucotomy summarized in earlier chapters. 

The construction of stich a general frame-work, of course, does 
not release us from the obligation to investizate separately the 
various factors which go to make up the structure, and to advance 
psychological hypotheses regarding their nature. Similarly, carefu 
experimental studies are called for to investigate individual tests 
known to have high saturations for any of the factors postulated. 
Such work, however, must follow the establishment of the main 
dimensions of personality; it cannot precede it. Consequently, We 
have little to add on these counts; it must be left to future work t° 
fill in these lacune. 

З Еуеп so, however, certain theoretical speculations may be of 
some interest, and they are advanced here with the explicit wat” 
ing that much experimental work will be required before we Сай 
assess their value. These speculations relate particularly to the 
neuroticism factor—possibly because more is known empirically 
regarding this factor than about any other; they are based 01 
certain obvious‘analogies between neurotic illness and such physic? 
events as, for instance, the fracture of metals—analogies which 
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-have passed into the very way in which we talk about neurotic 
‘breakdown’, being under ‘stress’, reaching ‘breaking-point’, or 
having a ‘brittle’ personality. This analogy can be pushed a good 
deal further, and as long as it is recognized as an analogy and 
nothing more than that, it may throw some interesting light on our 
explorations of those weaknesses in personality which give тїзє to 
neurotic break-downs. a o. ` * 

_ 1 may come as a shock to many psychologists not well versed 
in physical theory and knowledge that present-day knowledge of 
fractures of metals and their causes is far from satisfactory, and 
that there are various theories in the field none of which has as yet 
succeeded in providing direct experimental support for its*con- 
tentions. Again, as in psychology, we have the historical and the 


ahistorical approach, as well as the controversy between bee 


ents of a ‘dynamic’ and those of a ‘static? method of analysis. 
brief description of current physical thought on this subject may 
provide a frame-work for our subsequent discussion. ‘ жы ee 
"The basic scientific problem of the general conditions whic 
determine the ductile and brittle failure of metals is still largely 
unsolved’ (Sachs, 1948). Two main avenues of approach are 
being used to investigate these phenomena. The micro-mechanical 
method is being used to study the effect of various factors such as 
lattice imperfections, and the formation, reorientation, and growth 
of cracks. The phenomenonological method studies the general macro- 
Scopic laws underlying fracture. ‘Eventually the two types of inves- 
tigations will be correlated to provide a single self-consistent theory 
or fracturing of metals’ (Dorn, 1948). Most of our knowledge 
Concerning the effect of stress on the fracture of metals has Бес 
obtained from the phenomenological approach. There is a EJ 
complement here with psychological work, particularly with the 


distinction drawn between the molar and the molecula dece 
he micro-mechanical method would find its E ue. Selye 


neurological work, or in the kind of work summari : 
under the revealing title of ‘Stress’ (1950). The ин 907 
method would find its analogue in the molar арргоас it a хэн 
by most psychologists nowadays. We may indicate this corresp 
€nce in the form of a Table: . - 
Psychological methodology: 
(1) Molecular method. 
(2) Molar method. 


p ^ 


Physical methodology: 


(1) Micro-mechaziical method. 
(2) Phenomenological method. 


. 
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It used to be assumed in physics that there were two general 
criteria for fracture: (1) Fracture occurs when a critical state of 
strain is achieved, and (2) fracture occurs when a critical state of 
stress is achieved. It is now accepted on the basis of recent investi- 
gations that the critical strain and critical stress laws for fracture 
are individually incorrect, and that we must follow Bridgman in 
postulating (3) that the stress level for fracture depends upon the 
entire stress and strain histories preceding fracture. Here we have 
another analogue with psychological theory, contrasting the his- 
torical with the constitutional point of view. And again the reso- 
lution is similar—a recognition of both elements, and an attempt 
to obtain quantitative estimates of the influence of both. 

A geometrical representation in many ways similar to that 
provided by factor analysis in psychology may render the position 
in the physical field a little clearer. It can be shown that ‘the state 
of stress in a homogeneous isotropic metal is completely defined by 
the three principal stresses. Therefore, the stress state at fracture 
can be represented by a stress surface in three-dimensional cartesian 
co-ordinate stress space as shown in Figure 38. Whenever the 
stresses are less than the values on the limiting surface, the metal 


Figure 38.— Fracture Surface in Stress Space 
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will not fr: 9 
ТАК o 
fore, will not lead to нб antl Боож. елкы 
point F Ч e 1 state of stress represented Бу 
ao on the fracture surface is reached.”* 
way to E Vide iet cae that this fracture surface is in no 
the previous stress or KP its shape and size are determined by 
П Ыга nies ain history of the metal, and alter with it. 
the stress or дойт) ше membras whose distortion is dictated by 
(rain fhapés of Ey y. For homogeneous isotropic metals, only 
ИКЕН | ч he stress surface are theoretically possible. The 
angles wi th t symmetrical about line OM which makes equal 
КАЙ mort : positive directions of the three principal stresses. 
ofthe у 7 де surface must be independent of the orientation 
fore! TES Ён ахез of the parts being stressed. There- 
surface ofa cam of the principal stresses representing the fracture 
otat ons of 532 isotropic nietal must be invariant relative 
This con. he Cosürdsnate axes in the past. 
chology b cept of “fracture surface’ has no equ . 
hetristi d E Er potentially it would appear to have considerable 
“Бус xm RE Presumably axes having the properties mentioned 
Оман, De located through factorial studies, based on controlled 
tient n hs in the case of human subjects, and based on expert- 
would ae case of animals. The total size of the fracture surface 
© ey be identifiable with our neuroticism factor. 
hecause ih scussion -so far may appear theoretical and remote 
ИИ a fact the metals we work with are neither homo~ 
Ё оно isotropic, and certainly in the psychological field 
eee у апа поп-һотодепену are more prevalent. However, 
Ш са vances from the simple to the complex, and what has 
LM ld so far may serve as a first approximation. We must now 
i Wi compas certain complicating factors. - ё s 
strict] 5 three axes along which we plotted the fracture surface are 
y arbitrary in their location, although conventionally one 


vel H а . 
ould presumably orient them in relation to direction of the 
me treatment is applied to 


applicati 
ын ROME of the stress. When the sat 
ropic material, certain positions of the axes become prefer- 


ivalent in psy- 


entially homogeneous and 


stress from grair. to grain or even 
ristalline metal are not known at 
vave neglected the 


а 
he metal under discussion has been taken as 658 


lsotropi 
foni pot because the detailed variations in 
Ёсе: nt to point in a.single grain of polycristé 
- Most studies on fracture in polycristalline metals Ё 
ture of the metals. 


complicati 
cation: Бүс " 
S.S. p. s arising from the crystalline struc : 
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able; these can be determined by the differential reaction to stress 
of different parts of the material, by differing historical conditions 
of different parts of the material, or both. Similarly, it is the exist- 
- ence of inhomogeneity in human behaviour which decides us to 
" prefer опе set of factor axes to another—although even then no 
‘absolute’ value attaches to such a position. 2 
“Опе important change in our picture which must accompany 
the shift from isotropic to anisotropic material is the inclusion of 
time as an additional dimension. It has been known for a long 
time that brittle solids may support a static stress for long periods 
of time without the slightest evidence of approaching failure, which 
then occurs with great suddenness. Experimental evidence shows 
- that there is a definite relation between the intensity of the stress 
to which the brittle solid is subjected, and its duration necessary to 
produce a fracture. It was found that ‘the duration of the stress 
required to obtain the failure ‘decreased very sharply as the inten- 
_Sity of the applied stresses was increased’ (Poncelet, 1948). Г 
This phenomenon of ‘static fatigue’ cannot be reconciled with 
any of the classical theories of strength of brittle solids, which do 
“not take the duration factor into consideration. The first step 
towards a solution of this dilemma was taken by Inglis (1913); 
whose observation that scratches or cracks reduce the strength of 
hardened steel plates, through the considerable intensification of 
the applied stresses at the crack tips, was taken up by Griffith 
(1921, 1924) in his famous ‘crack’ theory. According to this view; 
he notorious weakness of brittle solids to tensile stresses is attributed 


^w 


to the existence of indefinite ‘flaws’ in the solids; these flaws he 


believed to be in essence submicroscopic cracks. He assumed the 
structure to be continuous and developed his views along thermo- 
dynamic lines. ‘This new approach . . . introduced the factor 
time in the expression which determines the breaking stress . + * 
of brittle solids, and gives an intelligible interpretation of the 
phenomenon of static fatigue’ (Poncelet, 1948). It also explains 
why the average stress required for the fracture of solids is many 
times smaller than that calculated from the forces acting betwee” 
the atoms. 


How is all this relevant to psychology? The application is sur- 
1 This creates a difficulty regarding the explanation of these flaws which 


can be sidestepped by assuming the structure of the solids to be particulate, ар! 


5 Y f 
composed of ions. In this way the phenomenon is brought into the fidd 
general stati"tical mechanics. 
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prisingly direct. Let us recall for a moment Thomson’s so-called ч 
‘sampling theory’ of intelligence. He explains the observed pattern 
of ifitercorrelations between intelligence tests in terms of bonds. - 
‘Each test calls upon a sample of the bonds which the mind can form . 
. . some of these bonds are common to two tests and cause their. ~ 
correlation.) While not venturing to suggest in detail what the - 
bonds might be, Thomson believes that ‘they are fairly certainly 
associated with the neurones or nerve cells of our brains, of which 
there are approximately one hundred thousand million in each . 
normal brain’. We may easily consider a theoretical. equivalent 
to Thomson's cognitive hypothesis in the affective and conative 
sphere by postulating that the observed pattern of intercorrelations , 
between neuroticism tests can be explained in terms of faults. Each. 
test calls upon a sample of the faults which are inherent in the mind. + 
.. . some of these faults are common to two tests and cause their ` 
correlation. No neurological locus will be suggested for these hypo- 
thetical faults, although it would not be difficult to find many _ 
possible claimants, particularly in the autonomic sphere: Here, A" 
then, we have a concept directly related to our previous discussion, 
because clearly these hypothetical faults are strictly analogous to 
Griffith’s submicroscopic cracks. y 
Certain other similarities may be stress 
almost complete failure of classical theories f 
results. There is little cause to document this statement with тезресї 
to psychology; in the physical field we may quote Slater's States 
ment that *we have been upset in modern times that materials eis 
eminently “ductile” as mild steel, as revealed by our oe c 
tensile test, have proved so undeniably and catastrophically britt i 
on many momentous occasions' (1948): Indeed, Ko р, 11 
classical theory is perhaps morc noticeable in the p us ; 
because nature calls the scientist's bluff more clearly and de A У. ON 
there than it does in the mental field, The ship sinks; ee e 
breaks; the steel tower topples—these are cvents which pag ur E 
explained away. But the patient who does PET. dat. ын 
psychoanalytic therapy, or who does not react to ^ ^ -+ P 


diction of combat success or academic achievement u^. aoe 
borne out in fact; the clinical diagnosis that is ice y o 
developments—these can be argued away Hia AE e 29 
fashion, Physics lives in the world of the reality oe Sei ЫГ 
chology and psychiatry have not altogether emerged 10 


world of the pleasure principle. M WE Nas 6771 


> 


ed. One of these is the 
to predict emp rical 
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Yet some at least of the failures of the psychologist must be laid 
at the door of those who determine the problems which he is to be 
set, and the conditions under which he shall attempt to solve them. 
It is quite customary to ask the psychologist to predict success of a 
group of people without specifying the exact nature of the situation 
in which success or failure will be decided. Thus the subjects who 
passed successfully through the O.S.S. assessment programme 
might be detailed to do one of a large number of very dissimilar 
jobs, ranging all the way from putting on propaganda shows in 
oriental countries to being dropped as secret agents into enemy- 
occupied territory. This situation would find an analogue in the 
physicist's being asked which of a large number of metal bars were 
likely to fracture, but without his being given any information as 
to the precise stress which would be applied to each of these bars. 
His ‘prediction’, if indeed he agreed to furnish one under such 
impossible conditions, would amount to little more than a state- 
ment of the present strength of the various bars. The success of his 
prediction would depend far more on the conditions to which 
these specimens would be exposed later, than on the accuracy of 
his measurement. 

Again, while the physicist's failures are spectacular, they do 
furnish a criterion which does not admit of argument. The psy- 
chologist's criterion is only too often unreliable, illogical, ап 

determined by extraneous considerations; low predictive accuracy 
is more often due to faults in the criterion than to faults in the 
predicting variables. What is worse, administrative action often 
interf&es with his attempts to establish proper controls, so that it 
becomes quite impossible to evaluate properly the results of his pro- 
cedures. Incomparing forecasting accuracy of physical and psycho" 
logical tests, these considerations must always be borne in mind. 

Similar considerations apply to the situation in which the psY~ 
chologist is called in to ‘advise’ on certain problems. He is usually 
expected to give his advice on the basis of general principles, а 
to do so more or less on the spot. Even in physics, where general 
principles have been established to a much greater extent than 1” 
Psychology, such a demand would be deprecated, and an actu? 
experimental investigation of the applicability of such principles 
to the particular situation insisted on. The psychologist, howeveb 
is seldom given the opportunity of carrying out such investigation? 
although in his case they are even more viial; he is expect’ 
instead, to advise on problems of the utmost complexity оп 109 
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basis of such very limited data as may be available—indeed, he must | 
count himself lucky if there are any data at all. Until those who 
pose the questions gain some insight into the methods of science, 
and learn to provide suitable conditions for an experimental deter- 
mination of the answer, psychology will continue to operate at a 
low level of scientific respectability. 

Given the existing external circumstances, however, the psy- 
chologist can make a maximum contribution by (а) isolating the 
main dimensions of personality, and (b) improving the accuracy 
with which they can be measured. It is with these two aims in view 
that the experiments described in this book have been carried out. 
To what extent may they be said to have been achieved? The 
isolation of three main dimensions—neuroticism, psychoticism, 
extraversion—may be said to have been accomplished at a reason- 
ably high level of confidence, although it would seem most desir- 
able that all of this work should “be repeated independently at 
other institutions; it is unfortunate that psychology has not adopted 
the practice widespread in the physical sciences where new claims 
are routinely tested by several institutions. Only in this way can we 
hope to build up a stock of verified trustworthy facts and principles; 
the single experiment may be of outstanding interest, but it 
can never be definitive. At the same time this procedure ensures 
that one of the most important of scientific requirements is met, 
namely reproducability of experimental "conditions. Identifiability 
of groups, of sampling, of experimental procedure; repeatability of 
diagnoses, of tests, of treatments—all these must be ensured if az 
experiment is to be accepted as being of scientific value. And there 
is no better proof of reproducability of conditions than reproduc- 
tion of results, although the inverse of this relationship does not of 
course hold. Subject, then, to such repetition we may claim to have 
made a contribution to our first task. гіз 4 

Regarding the actual measurement of these dimensions, we 


have shown in the text that neuroticism can be measured with a 
f about :80: these figures 


, and are certainly 
en for the other two 


E 


inherently less exact or less reli 
Warnings which long experience W 
desirable in order to avoid disappointment- 


> 
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In the first place, while on most of the groups studied ee 
the same tests are found to correlate together to define a given - 
factor—neuroticism, say—it is quite common to find that ү pr 
particular group one particular test will fall out of ШЕ, Eo 
examples of this tendency have been mentioned already in M 
text. Thus for middle-class groups the Crown Word Connection 
List has a high saturation with neuroticism, and low come : 
with intelligence; for a group ofunskilled workers studied by На 
(1951) this test was found, however, to have only relatively o 
saturations with neuroticism, but high saturations with шеш ЕД 
This change can of course be explained in terms of different stag С 
ards of literacy, nevertheless it indicates that results from опе grouk 
cannot immediately be extended to other groups differing хуц 
the original one with respect to intelligence, social status, we 
other important ways. Again, while for groups of normal pei. 
ligence the persistence test usually has good saturations M. 
neurotics, it has been found quite consistently (Brady, 194 T 
O'Connor, 1951) that in mental defectives there is hardly any cor- 
relation between persistence and neuroticism. No obvious explana 
tion of this difference suggests itself, and a special experiment 
would be required to study this discrepancy. Indeed, it is quite 
likely that a great deal may be learned about the nature of the 
"factors involved by the careful study of such discrepant results. 

? А second difficulty which should not be underrated is related 
to the problem of test standardization. We have ample evidence 
hat even slight changes in the test may produce profound changes 
in the results. Thus changing the method of recording sway on the 
body sway test, or changing the record on which the suggestion (0 
‘sway forward’ is made, may have a profound influence on the 
results obtained. Whether the subject can see the apparatus К 
entry or not, and whether he is frightened or not; the degree 0 
reassurance given to him, and the reasons advanced for making 
him submit to being tested at all; whether the tester is a prets 
young girl, a dominecring male, or a mature, sympathetic мохи 
—all these influences may, and in some cases do, affect a personae 
scores. They are difficult to control, particularly as the We. 
stimulus—a pretty young girl as tester, for instance—may me is 
quite different things, and arouse quite different emotions, in E 

~ jects differing in age, sex, and marital happiness. Standardizat 
is not impossib'e, but it is much more difficult with tests of t е 
type than it is with tests of intellectual capacity. When what we 2 
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trying to measure are those somewhat intangible things—ego in- 
volvements, motivation, strength of drives—then we cannot in 
any simple way control the physical properties of our stimuli and 
assume equivalence. Here again, differences in results from one 
experiment to another may at first be disheartening; if they are 
regarded less as experimental failures, but rather as leads to definite 
hypotheses, and stimuli to further experimentation devoted to a 
solution of the particular problem created by these inconsistencies, 
they may actually be of considerable use and importance. 

A third difficulty is connected with the one just mentioned. 
From many points of view, it would be desirable to keep tests 
identical from one experiment to another, so that cross-comparisons = 
can be made. But ifthat is done we miss our opportunity to improve 
existing tests, and make use of such knowledge as we may have 
acquired during the course of our experiments. These two require- 
ments have to be carefully balanced, and a decision is not always 
easy. Premature standardization may serve merely to crystallize 
poor and ineffective practices; constant changes may make all 
useful comparisons impossible. In the past we have laid greater 
emphasis on improvement and change; in the future we shall lay 
greater emphasis on standardization. But this problem cannot be 
settled by any facile generalization, and it is likely to arise when- 
ever a particular experiment is being devised. 

This point is closely related to the question of the amount of 
stress which the experimenter is to lay on the main two aspects of | 
his work. He may either use many rough-and-ready tests, in order, = 
to isolate the main dimensions along which he intends to work; or 
he may carry out careful and detailed experimentation on а small 
number of tests, in the hope of refining these sufficiently to derive 
meaningful scores from them. Ап example may make clear the 
difference between these two approaches. We have used in our 


work raw scores on tests such as the Body Sway Suggestibility test, 
correlate significantly 


or the fluency test, and shown that these tests : 1 
ings are interesting 


With the neuroticism factor. While such findin; : 
eave the investigator some- 


and important, they must nevertheless 1 1 : 
what dissatisfied. Performance on even such relatively simple tests 
is certainly highly complex, and it is difficult to interpret findings . 
Psychologically. Thus Bousfield and Sedgewick (1944) made a 
detailed study of fluency tests, and found that curves of output... 


could be described ‘by an exponential equation: N= G= em) 
where N is the number of words written up to 


> 


a given time, t; Cis 
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the upper limit that the curve approaches asymptotically, pre- 
sumably identifiable with the total supply of the kind of words 
called for by the instructions, while e is the base of the natural 
logarithms, and m measures the rate of depletion of the supply of 
words. These writers have also shown that the mathematically 
derived constants have psychological as well as mathematical 
meaning. When their subjects were given preliminary practice in 
naming automobiles the rate constant (m) increased, but not the 
supply of names constant (C). In another experiment, Bousfield 
& Sedgewick (1944) found that the theoretical supply of pleasant 
words was considerably larger than the supply of unpleasant words, 
but that the rate constant was the same. These findings make 
psychological sense, and suggest that this theory can be applied to 
individual differences in fluency. The number of words written in 
a given time is of course the raw score conventionally used as 4 
measure of an individual's flaency; C and m are analytical or 
derived scores which in conjunction with cach other determine the 
raw sccres. 

. If we accept this theory, we see immediately that intercorrcla- 
tions of raw scores, factor analyses of such intercorrelations, and 
correlations of raw scores with outside criteria, all cease to have 

г any interpretable meaning. Observed correlations may be due to 
the influence of the C constant, the m constant, or any combination 
of the two; they would: fluctuate in an unpredictable manner 
according to such variables as type of material, selected, or length 

of test. (Clearly the raw score on a short test depends тб оп М, 
on a lug test more on C.) In other words, analysis of raw scores 
would be psychologically meaningless, and give much less promis- 
ing results than might be got from analysis of the C and m scores- 

! Similar considerations apply to the body sway test of suggesti- 
bility. A psychological analysis of this test (Eysenck, 1947) showed 
that at least two separate factors determine the raw score: the 
strength of the ideomotor connections within the individual who 
18 being tested (his ‘aptitude’), and his ability to resist this tend- 
ency. His raw score, therefore, would be something in the nature 
of a ratio of the two. Now the hypothesis advanced to explain the 
correlation of suggestibility with neuroticism depends only on onc 
of the elements in this ratio, viz. the subject's ability to resist; there 
15 no reason to expect neurotics to differ from normals with respe? 
to their ‘aptitude’. Consequently, ‘aptitude’ siinply serves to Хе! 
à confusing and complicating influence on the score when we want 


мы >; DDR ee 
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to correlate neuroticism with ‘ability to resist suggestion’. However, 
as we cannot at the moment measure ‘aptitude’ separately, and 
thus derive a score for ‘ability to resist’ directly, we must remain 
content with the raw score, which will be considerably attenuated 
by the influence of ‘aptitude’. Work is proceeding at the moment 
to obtain separate measurements for the two parts of the ratio 
separately, but whatever the result of these endeavours, it must be 
clear that the raw score, in spite of its apparent simplicity, is in 
reality quite a complex resultant of several unrelated forces. 
What is true of these two tests is presumably true, to a greater 
or lesser degree, of all our tests. Although superficially simple, yet 
in reality they are quite complex, and the raw score may bear no 
very constant or high relation to any of the forces which determine 
it. This fact may explain the comparatively low correlations which 
are usually found between tests of neuroticism; we are correlating 
raw scores, each of which is the resultant of several forces of which 
only one may be relevant to our analysis. If we could carry our 
analysis further, and obtain direct measures of these relevant 
forces, the attenuation of our correlations would be much reduced, 
and presumably the size of the correlation would be increased. 
At the same time we might in this way obtain evidence regard- 
ing the contradictory findings which characterize so much of the 
literature on personality testing. Let us assume, for instance, that 
experimenter A finds that a test of fluency has a high saturation for 
intelligence, but а low saturation for neuroticism, while B reports 
that his test of fluency has low saturation for intelligence, but high 
saturation for neuroticism. These results obtained from ‘che raw 
scores appear contradictory and inexplicable. Now let us assume, 


as is quite likely, that the ‘total fund of relevant words’—C—is 
te of production — 


highly correlated with intelligence while the ‘ra ae 3 
m—is highly correlated with neuroticism. A short test would depen 
more on т than on С, and would thus be a test of neuroticism more 


than of intelligence. A rather longer test would depend more on с 
lligence rather than one 


than on т, and would thus be a test of inte x Я 
of neuroticism. Thus by simply уагу!п5 the length o Е ын ; 
experimenter B might obtain superficially discrepant results an 
A. By working with analytical scores, both experimenters e 
find i ions on intelligence, т on neuroticism, 
that C had high saturations deu жир! 


regardless of the length of test used. Simi d 
Course, apply to other variables, such as level of education, age; 


type of stimulus material, or sex; superficially discrepant results 


КШ 
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obtained from raw scores should be reconcilable in terms ofanalytic 
scores. 4 
The conclusion might be drawn from this discussion that the 
most urgent task in the psychology of personality is the detailed 
analysis of individual tests and techniques along these lines. This 
conclusion would, in our view, be as false as the opposite one that _ 
the isolation of the main dimensions must precede such detailed | 
analysis. In our view, both procedures must go hand in hand; 
promising tests are discovered in terms of their correlation with 
neuroticism, or psychoticism, or extraversion; they must then be — 
analysed in detail, purified, and the new analytical scores used in | 
- order to determine with even greater accuracy the relevant factor. - 
It is this interplay of statistical and experimental procedures which 
in our view will lead to important developments in the future. 
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